SEQUENCE LISTING 



<110> Farnet, Chris 

Zazopoulos, Emmanuel 
Staffa, Alfredo 

<120> Genes and Proteins for Biosynthesis of Ramoplanin 

<130> 3019-PCT 

<160> 34 

<170> Patentin version 3.0 

<210> 1 

<211> 88421 

<212> DNA 

<213> Actinoplanes sp. 
<220> 

<221> misc_feature 

<222> (2077) . . (3078) 

<223> ORF 1; positive strandedness 

<220> 

<221> misc_feature 

<222> (3118) . , (4032) 

<223> ORF 2; positive strandedness 

<220> 

<221> misc_feature 

<222> (4038) . . (5048) 

<223> ORF 3; positive strandedness 

<220> 

<221> misc_feature 

<222> (4038) , . (5048) 

<223> ORF 3; positive strandedness 
<220> 

<221> misc_feature 

<222> (6665) . . (5814) 

<223> ORF 4; negative strandedness 

<220> 

<221> misc_feature 

<222> (7703) . . (6693) 

<22 3> ORF 5; negative strandedness 

<22 0> 

<221> misc_feature 

<222> (9464) . . (8130) 

<223> ORF 6; negative strandedness 
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<220> 

<221> iriisc_f eature 

<222> (9691) . . (10761) 

<223> ORF 7; positive strandedness 



<220> 

<221> misc_feature 

<222> (12751) . . (10829) 

<223> ORF 8; negative strandedness 



<220> 

<221> misc_f eature 

<222> (13617) . . (12802) 

<223> ORF 9; negative strandedness 



<220> 

<221> misc_feature 

<222> (15203) . . (13614) 

<223> ORF 10; negative strandedness 



<220> 

<221> misc_feature 

<222> (15591) . . (15863) 

<223> ORF 11; positive strandedness 



<220> 

<221> misc^feature 

<222> (15880) . . (19035) 

<223> ORF 12; positive strandedness 



<220> 

<221> raisc_f eature 

<222> (19032) . . (39713) 

<223> ORF 13; positive strandedness 



<220> 

<221> misc_f eature 

<222> (39713) . . (65800) 

<22 3> ORF 14; positive strandedness 



<220> 

<221> misc__f eature 

<222> (65826) . . (66530) 

<223> ORF 15; positive strandedness 

<220> 

<221> inisc^f eature 

<222> (66546) . . (67370) 

<223> ORF 16; positive strandedness 



<220> 



<221> 
<222> 
<223> 



misc_f eature 
(67384) . . (70059) 
ORF 17; positive strandedness 



<220> 

<221> misc_f eature 

<222> (70099) . . (70662) 

<223> ORF 18; positive strandedness 



<220> 

<221> misc„feature 

<222> (70659) . . (71906) 

<223> ORF 19; positive strandedness 



<220> 

<221> rnisc_f eature 

<222> (73439) . . (71964) 

<223> ORF 20; negative strandedness 



<220> 

<221> misc„f eature 

<222> (74216) . . (73563) 

<223> ORF 21; negative strandedness 



<220> 

<221> misc__f eature 

<222> (75424) . . (74213) 

<223> ORF 22; negative strandedness 



<220> 

<221> misc_feature 

<222> (75535) . , (76464) 

<223> ORF 23; positive strandedness 

<220> 

<221> misc_f eature 

<222> (78110) . . (76449) 

<223> ORF 24; negative strandedness 

<220> 

<221> inisc_f eature 

<222> (79864) . . (78107) 

<2 2 3> ORF 25; negative strandedness 

<220> 

<221> misc_feature 

<222> (81624) . . (79861) 

<223> ORF 26; negative strandedness 

<220> 

<221> misc_f eature 
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<222> 
<223> 



(81909) . , (81682) 
ORF 27; negative strandedness 



<220> 

<221> misc„f eature 

<222> (82346) . . (82062) 

<223> ORF 28; negative strandedness 



<220> 

<221> misc_f eature 

<222> (82587) . . (84446) 

<223> ORF 29; positive strandedness 



<220> 

<221> misc_f eature 

<222> (84481) . . (85548) 

<223> ORF 30; positive strandedness 



<220> 

<221> misc_feature 

<222> (85556) . . (86845) 

<223> ORF 31; positive strandedness 



<220> 

<221> misc_f eature 

<222> (87372) . . (86803) 

<223> ORF 32; positive strandedness 



<220> 

<221> misc_f eature 

<222> (87494) . . (88420) 

<223> ORF 33; positive strandedness; N- terminus only 



<400> 



ggcgaactgc 


ttgtcctcgc 


tcggcggcag 


gctgttccac 


ctgtccttct 


cggccatcgg 


60 


cacgatcacc 


tcgttgaaga 


gggggttgcc 


gagccgggag 


acctgcacct 


gcgggccgac 


120 


ggtcacatcg 


ccggaggagg 


agccgtcgcg 


gacctgaacc 


tgacggcgac 


tggccgaggt 


180 


ccacaccccg 


atgaccgcgc 


ccgcgtcacg 


tccgcgcacc 


cgcttcttgc 


cgtcacggcg 


240 


caccatgtgt 


^cggggatct 


gcagcgcgat 


gctgtgcacg 


ttggtcttgt 


cggtggcgtt 


300 


gaccgccttg 


ccggcgtagt 


tgaacaggtt 


ctgcccgacc 


aggtgcttgt 


cctggaacgg 


360 


gcgcagcgtg 


ccgaggtcga 


agatggcgcc 


cagatcgacg 


aagaaggcgt 


cggcgcgctg 


420 


gccggcgaag 


accttctcgc 


cggtcttcag 


cttgtgcacg 


gcgtcggcgg 


cgaggccgtc 


480 


gtagtcggcg 


atcgacacct 


tgccgacgtt 


gcacggcggg 


cagggcagct 


tgctcgccag 


540 


caccgtgctc 


ttgccgtgct 


tgtcgacctt 


cgtcaccgag 


tagaactggc 


ggcgattcca 


600 
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gttctcgctg tcgagcgact cgatcgggcc ggtgttgtag aggaacgtct tgttgttgcg 660 

cagctcggtg cggaaccgga actggtaggt gatctcggcg cgggcgtcac cgtccgagtc 720 

gatgtggatc tcgtagacga cgtcgtcgcc gaactcgaag aagttcgggc cgctcgccgg 780 

cagctgcaac ggcacgtagt tggcgatcag ggtgaccgtg tcgggctcgt cggggctgac 84 0 

gaaggcgtac aggtcggagc tgtcggccac cgggtccttg ctgatctcgg gtgcttcgcg 900 

gtgtgaagac atgtcatgcc ctcccggcgg ccttgatcag gctcgcgacg agcttcttgt 960 

cggtgatcgt cttgagctgc tcgcccgaga agacgtcgat cgtgcccttc ttggcgtccc 1020 

ggatggagac gacgatcggg gagccgtcac cggtgggctc gttctcgccg gcgaggctca 1080 

gcccggcgac cgaggcggcc gatacgccga cgccggtcgc ggcgagggcg aggacccggc 1140 

gacgggaaag ccacggacgt gctgccgggg cgcgatagcg agtgcgactc atgctcgacc 12 00 

tttcgtcttt cttacggtcg gtggtgcgag gactggaacg ggcgttgcgt gtcgacgccc 12 60 

gtcagtacgt gatccaccgg cctgcggttc aaagacgaat tggagcgctc ccaacgggac 1320 

gactcgagcc gcgccggggc gggcacggca ccggccgtgc ccgccccgcg tgcgcggtca 1380 

ccgccagttg gcgtgatccg cggcggcggc gcggtcgcgc tcgtcggcgg cccggcggcc 144 0 

cttgtcgacc tcgcgcatcg accagccgat cgcgacggcg gccaggtagg cgagggtgcc 1500 

gagcaccgcc acggcccgga agccgccggt cacccagcag gccaggaaca cggccaggcc 1560 

accggcgccg atgcccgccg cccacgagga cggcagccgg cggatcgccg agccgaccag 1620 

cacggccagg cccagcgacg cgatcgggct cggctcctgc ggcaggtcgg cgaagtggga 1680 

gacgagcacg gcgatcaggc cgaggcccac gcccgccgcc gcggtcatcg ccgggttgat 1740 

ccggcgggcc agggccaggc cggtcatcag cacggcgacg atcgtgtcga agaccgcgta 18 0 0 

accggcgccc cagctgtcgg agagcatgta gacggtgaac gcgatgccgg ccagcgcgat 18 60 

cagcccgaac accacgtcga aggcgacacc ggcccgcggg gccggccttg ctgaagtagt 1920 

catgccgccc acgctaggaa atcggcccgt ccggcgaacc ggtcgaaagt acggtcggcg 1980 

ccgcggcgaa ccctgctttg gaccgatgct cgcgacgggg tgcttccccg agggtgatgg 2 040 

catgcgtttc taccacgagg tgcgccgatg atctggatga gctggcgcca gttccgctgg 2100 

caggccctgg ccggtgccgt cgccctggtg ccgttggtgg cctacttgat cgtcacgagc 2160 

ctggacatcc ggcgcgccca cgaccgctat caggcgcagt gcgcgtccat cggcaactgc 2220 

gccgaggcga tgctccagtt ccagaacgac ttccgcaccc gcctgctgct gctcgccatc 2280 

ctgctggccg cgatccccgg catcctcggg gtgttctggg gcgcgccgct ggtggcccgc 2340 

gagctcgaga ccggcacgca ccgcctggtc tggaaccaga gcgtcacccg gcgccggtgg 2400 

ctggcggtca aggtgctgtt cgtcggtgtc gccgcgatgg ccgtggccac gctcgtcagc 2460 



acgctgctga 


cc tgggcgag 


cagcccggtc 


gacgcggtgt 


cgcaggaccg 


gttcggcgcg 


2520 


ctggtgttcg 


acgcccgcaa 


catcgtgccg 


gtcgcgtacg 


ccgccttcgc 


cctcgtcctc 


2580 


ggcacggtga 


tcggcctgct 


cgtgcgccgc 


accatcccgg 


ccatggcgct 


caccatgctc 


2640 


gtcttcgccg 


tcgtgcagtt 


caccgtgccg 


gcgctggccc 


ggccgcacct 


gatggcgccg 


2700 


gagacccaga 


cccggcagat 


gacgttgcag 


gagttcggcg 


aggtgcgcgg 


cttcggcgac 


2760 


gagcccacgg 


tcaacgggct 


gagcatccgg 


ggcgcgtggg 


tgaccagcac 


cagcccgctg 


2820 


ctcaccgccg 


acgggacccg 


gctcgacaag 


gccacgtacc 


gcaaatgcgt 


gaccgacccc 


2880 


ccggccgtct 


cgggcggagc 


tcccggcgtc 


ggcggcaccg 


tcgcctgcct 


ggccgacctc 


2940 


gatctgcacg 


tcgaggtggc 


ctaccagccc 


aacgaccggt 


actggacctt 


ccagtggatc 


3000 


gagtcggccc 


tctacctggc 


gctcggtgga 


ctgctcctcg 


ccgtgggcct 


gtggcgcatc 


3060 


cgccgccacg 


tcatctgatc 


gtccccccgt 


ccgcacggat 


tcgaaggata 


gaaagacatg 


3120 


ccacacgagg 


attcctcgcc 


cgttctgcag 


gcggagggct 


tgaccaaacg 


ctacggtcgg 


3180 


cgcaccgccc 


tgcaggactg 


caacctgacc 


attccgcgcg 


gccgggtgat 


cggcctggtc 


3240 


ggcccgaacg 


gcgccggcaa 


gtcgacgctg 


ctccagctgg 


cctgcgggct 


gatcacgccg 


3300 


tcggagggct 


cgctgcgcgt 


gctcggcgag 


acgccggccg 


cgaacgccgg 


ccacctcgcc 


3360 


aaggtcggct 


tcgtcgcaca 


ggacaccccg 


gtctacagca 


acttcacggt 


cggcgaccac 


3420 


ctgaagatgg 


gtgccaagct 


caacccgacg 


tgggaccagg 


cgctcgccga 


gcgccgcgtc 


3480 


gcgcaggtcg 


ggctcaacca 


cggccagaag 


gcgggccggc 


tctccggcgg 


tcagcgcgcc 


3540 


cagctcgccc 


tgacgcttgc 


cgccgccaag 


cgcccggaac 


tgctgatgtt 


cgacgagccg 


3600 


gccgccgcgc 


tcgacccgct 


ggcccgcgac 


ggcttcctgc 


agaacctgct 


cgagttcgtc 


3660 


accgagctcg 


acgccagcgc 


gatcctgtcg 


tcgcacctgc 


tcggcgacgt 


cgagcgcgtc 


3720 


tgcaactacc 


tgatcgtgct 


ctgcgcctcc 


cgggtgcagg 


tcgccggcga 


cgttcccgac 


3780 


ctgctcaaca 


cgcactaccg 


catcgtcgcg 


ccccgcggcg 


agctggacca 


tccgccggcc 


3840 


ggcctcgagg 


tcatccgggc 


gcagcacgcc 


gaccggtaca 


ccaccgccgt 


cgtgcgcggc 


3900 


gacggcagcc 


ggccgagcac 


ctggacgatc 


gagcccatcc 


agctcgagga 


gctcgtgctg 


3960 


gcgtacatga 


cgcgggcgat 


gggcgtcacc 


ggcgagccgc 


tgatggccgc 


gtccggggag 


4020 


gtcgtccgtt 


gatctggatg 


agctggcggc 


agttccgcgg 


tcaggccgtc 


gtcggggtcg 


4080 


tcgtgctggc 


cctgctcgcc 


gcatacctgg 


tctacctcgg 


cgtcgacatc 


cgcggcgcct 


4140 


acgacgacta 


tcgggcgcag 


tgccccgcgg 


gcggcgactg 


cgccgggccc 


ctgggccagt 


4200 


tcagcctcga 


ctacgagaac 


acgttgctct 


atctggccgg 


cgtgctggcg 


ctggtgcccg 


4260 



gcctgctcgg catgttctgg ggcgcgcccc tgatcacccg ggagctggag aacggcaccc 4320 

agcgcctggt gtggaaccag agcgtgaccc gccgccgatg gctgctgatc aagctactcg 4380 

tcgtgggctt ggcctgcatg gtggtggccg gggtgccgag cctgctgctg acctgggccg 4440 

ccgcgccggt cgacaatgtg gccgacaacc ggttcagcac ggtgatgttc ggagcccggt 4500 

tcctgccgcc gatcgcctac gccgccttcg cgttcgtgct cggcacgctc atcggcctgc 4560 

tggtccgccg gacggtgccg gcgatggcgc tcacgctcgt ggcgttcgtg atcttccagt 4620 

tcctggtgcc gaacctggtg cgcccccacc tcatgccggc caagcacctg gtcaagccga 4680 

tgacggtgag cgccatcaac gaggccaagt cgctgggcag catcaccggc gcgccggtgc 4740 

tgaacggcct gtcgatctcg cagggctgga tcaccgacgt cagcgcgctc aagaccgccg 4800 

acggccggtc gctggacgcg aagacgttcg acaactgcta catgaacgcg cccaagaccg 4860 

gtgcgaccga gggcccgtac ggtgacgtcg cggtctgcct ggccaagctg gacctgcacg 4920 

tcgacatcgc ctaccagccg tggaaccggt actgggcctt ccagttcctc gaatcggggt 4980 

tctatgtgct gctcagcggc ctgctgatcg gcgccgcggt gtggcgcgtc cagcggcggc 5040 

ccagctgaga tgagcgcgca cagcaccgtg accgagccgg gcgcactcgt ccgtggcacc 5100 

cgggaagccg ggggcggcgg cgagcgtcgt gccgcggtgg ccgccctggt catcggcggg 5160 

gcggcggtgc ggtaccgcac ggtgctggtg gcgctgggaa gcccgccgac cgacagcgac 522 0 

gaagcagttc tcgggctggt ctccctgcac atcgcgcagg gacgcgacgc accggtccat 52 80 

ctgtacggcc aggactacat gggcatgctc gaggcccacc tggccgcacc gctggtccgg 5340 

gcgttcggcg ccggggtggt gccggtccgt gcgccgctgc tcgtcctgtt cgcgctgttc 5400 

ctggtcgtca tgtaccagct cacccggaga ctgtggtcgc cgggcgtcgc ggtggcgacg 5460 

gtgctggtgc tcggtctcgg cgccgaccgc gtgctgagcg ccgagttgac ggccggcggc 5520 

gggtacccgg agatcgccgt tctcggcgcg ctcctgttcc tcctcgcggt gcggatcggc 5580 

cgcggggagg tccgccgccc ggtgctcgcc ctggccgaat tcgggcttgc gctgggcctc 5640 

ggcgtgtcga tggtggccaa cctctgcccg ggcgaccgct gcggcactac gagtttctgg 5700 

tggccgccga tctgtctcgc gctgctcgcg gtgacggccg ggccggcggt gcgtgcgctg 5760 

cggcgcgccg ttccggagcc ggcggcgatc cggtcgcgcc cgccgacgcg cggtcagggg 5820 

ctggcggcga tgcggtcgcg cccgtcgacg ccgcgctgga gcatgccggt gagcgcggcg 5 880 

agcgcctccg cgcggcccgg atgggcctcc agctcgcgca gcgccgagat ggtcgccgcc 5940 

agctcggcgg acaggtcgtc gagcacgtcg gcgaccgcgc cggcgttggc ggccaggatc 6 000 

tccgtccaca gtgccgcgcg accgcccgcg atccgcgtgg tgtcccgcac gccctggccg 6060 

gccaggccca getgggccgg ggtgccgtcc agcatccgcg cggccagcag gccggccacc 6120 



aggtgcggca 


cgtgcgagac 


gagggccacc 


gcccggtcgt 


gctcctcggc 


gctcatcagc 


6180 


accggcgtgg 


cgccgcaggc 


cgaaaccacg 


gccagggccc 


cgtcgacggc 


ggcggtgctg 


6240 


cttcgccggc 


cgggggagag 


cacccagggc 


cgcccctcga 


acagcgagcc 


ccgggccgcg 


6300 


tgcggtccgg 


agcgttcgct 


gccggcgagg 


ggatgcccgc 


cgacgtggga 


ggaggcgtcg 


6360 


cagccgagca 


cctcgatctg 


ccgggagggc 


agcactttca 


cgctcgccgc 


atcggtgtgc 


6420 


acccgggccg 


ttccccgccg 


ctggaggtcg 


gccagcacgg 


gagcgacggc 


cgccggcggc 


6480 


accgcgatca 


cggcgacgtc 


cacccgggtg 


cgcggctcgc 


ccgcgatgcc 


cgcgccgagc 


6540 


gccgccgcgg 


cgcgcgcggc 


ggcgggatcg 


gcgtccagca 


ggtgaacggt 


gatgtcacgt 


6600 


tgggtgagag 


cgaggccgac 


cgaggtcccg 


atcagcccgg 


tgccgacgac 


gaccgcgctg 


6660 


cgcacaaggg 


ctcccatcgc 


cgcattcgct 


gttcaggccg 


cgttcttctc 


gagtgtttcg 


6720 


gcgaattcca 


accactcttg 


tgaacaacgc 


cgggccacgt 


tcgccaggac 


gtacgtgcaa 


6780 


tgcgacggaa 


ccgcgtcgag 


catgtctttc 


cactcatcat 


cttgaaccgt 


gcgggcaaga 


6840 


acccatcgca 


aaagattgcg 


gccggattcg 


gtaaatcgca 


gcgacgggtc 


gttcttcaag 


6900 


ccttgaagta 


ttgaccgcac 


ggtcggctcg 


agcagccgaa 


tcgtgttttc 


cttgtcgaag 


6960 


gaaatgtcgt 


cgcggtgacc 


gcccgtgcgg 


cgggggccgg 


gcagcgggtc 


ctcgccgcgc 


7020 


tgcaggcgat 


tgcgcacgtc 


acgcgcggtc 


gagggcgata 


cgccggcgtt 


cttggcgatc 


7080 


tcccgcaggg 


aggcgtcggg 


ctgttgctga 


atgtagctga 


cggccttcag 


tcggccctcg 


7140 


gaattgtcca 


gcgggcgcac 


ccggccgtcc 


cggccgacgc 


gggtgcgcga 


gccctgcccg 


7200 


atgtcgccgg 


agagctcgag 


gcgccggcgt 


atgttgccca 


cggtgcgggc 


gctgagtccg 


7260 


ctggaggccg 


ctattgtgcg 


gtctgaccag 


gacggatgcg 


actcgatgat 


ccgttccgcg 


7320 


gcgcgcgtgc 


ggtccgctgt 


ggacagtggc 


aggccgtgcg 


cgatgttcgc 


cttgacgccg 


7380 


aggacgaatg 


cctcctgctc 


gctgccctcg 


aacatcgcgg 


ccttgatcag 


ctcgtcgccg 


7440 


cgcagccggg 


ccgcgcccag 


ccggtgcgcg 


ccgtcgatga 


cccgcatgct 


tgcgcggtgc 


7500 


acgatgatcg 


gcggcagttc 


ggcgtcgagg 


ctggccagca 


tctgggtgtg 


ccgcggatcc 


7560 


tcgccggcca 


gccggggtga 


atcgagcatt 


tccagtgaat 


ccacaggtag 


ccactcgacg 


7620 


cgcatcattg 


acggagtgtt 


catctgtgtt 


tcgtcgtgcc 


gatccggctc 


gtggcgcgct 


7680 


gacgcgatgt 


gcaatgac tc 


cataccaacc 


cccgaatggc 


tggactccga 


tgaccgctgg 


7740 


acacgatgca 


ccgcaggtct 


caacgagatg 


ctttgacgac 


cgtcggccgg 


ctcggccgtc 


7800 


gtggccgagt 


gcgcttgtgc 


agcgggcttt 


caatgtccga 


cactaatcga 


ggcggcgctg 


7860 


gctcgccaag 


ctcgtaagag 


gttctctgac 


gcatttttaa 


gctgtcttca 


cggtgaagtt 


7920 



tatcgatccc acggtgggtc ggtgggcgtc 
cgtgtccgtg ccgtfccgtac tggtcaaccg 
ccggccgggt ccgggccgcc ccggccgagg 
caagttgcag actgtgcgat tcgggtgggt 
atgaagcccg ccagccgctt gacaccctcg 
gacaggcgga tgccctgttc gccgccgccg 
atcaccccgt ggtcctcggc gcttcgggac 
ttcacggcga ggaagaagcc gccgctcggg 
aaggacgccg gaaggtgctc ctcgagccgg 
tgcgccgagc tctccgcact cgcctcggcc 
gccacggcgg cctggctcag cggcgaggtg 
agctcgtcgg cgagcaggcc ggcgccgccc 
C3 gcgaagccca cgcgcgcgcc cggaaagagc 

^:f\ tgccggcccg ggtcgagggc cttgagggag 

..^]! tacgggctgt cctcgatcac cagcaggccg 

Is3 cgggcctcca gcggcatcgt ggcgcccgac 

y3 gcgcggggac gccggccccg gctcagctcc 

J^, tggaagccgt cctcgcgctc ggcgacgggc 

i-r-i! ccggtgatgc cgacgtagca cgggctcgcc 

Ifl agcgcgcgca gcgccagcag catcgcctcc 

r? ggagccacgt cgatgccctc gtcgcgccgc 

ccggccgccg ggccgtactg gaagacggcg 
ccggcgaggt gctcgagata gccgcggatg 
ccgtcgaacg ggcggccggg cgcgaacgag 
ttgaggaagt tcatggtgtc cagcagcggg 
cgcaggacgc ggccggtgcc cggagcctcg 
gtcgcgaagc cctgactgtg gcaccgcatt 
accgcacacc tgtgcaactg ccgcgaacgc 
gtggacggtt cggccgacgt ccatagtggt 
cctccaggcc gccagcgaaa ggacgccgcc 
ctcgcggact acgaacagct ggcccaagcg 



gagcgggatc tgatcgccac gatgcgtcat 7980 

ggagatctcg ggggtgccac ggtggacgtg 8040 

gcgccgggga cgggttcggc cctatttctg 8100 

cacggccgga gtgccgcgat ctcggtcgtg 8160 

tcgatctcct ccggggtgag gtagctgatc 8220 

gccgggtaga agtacgacat cggcgtccag 82 8 0 

Sigggcggcgt tgtcggcggt gaagggcacg 83 4 0 

cggttccagc gcacgcccgt gcgtgcgcgg 8400 

tcgagggtgc gccgcatcgc cgcgccgtag 8460 

gcggtgccgc cggccgcgag cagcatcccg 852 0 

ttcaccgtga ccatgctctt caccttcgcc 8580 

gcggcgtccg acaccggctg gtcggcgatc 8640 

gtcttggaga acgatccgag gtggacgacg 87 00 

ggaagctgct gccccgggct gacgagccgg 87 6 0 

agctcgccgg cgaggtcgag cagggcgtgg 8 82 0 

gggttggtgt ggtcgggcac gacatagaag 8880 

gcgtggaccg cgcgggccag gtcctcggga 8940 

accgggtcga tgtcgagcag ccgcgcggcg 9 000 

acgaacagcg cgtcgcgctc gtcccggatc 9060 

tgcgcgccga ccgtcacgac gatggactcc 912 0 

agccactgcg cgatcacctc gcggatgcgg 9180 

tccctgatct ccgcgggcga gcggccctgg 9240 

ccgcggaaga tctgctcgac gtcgaagaac 93 00 

acggctcgcg gatagcgggc ggtcacctcg 93 60 

tcggacaggc tctggtgcag gtcctcgcgc 942 0 

cgcaggatgc tcatcatggt ctcctcggtc 9480 

ccgcaccgat caatcgaagc cacggcggtg 9540 

acggttggac accgtcggtg agctgcgttg 9600 

ccgggggctc gctcggccgc ctcgccggcg 966 0 

gtgaccgcca ccgccctcct gcccctgacc 9720 

cgaatggagc ccccggtgtg ggacttcatc 9780 



gccggcggcg cgggggagga gctgacgctg gccgcgaaca ccgccgcctt cgcaccgccg 9840 
cggctgcggc cacgggtgct gaccggcgcg ggcgcgccgg acacgggcac gacgatcctc 9900 
ggacggcggt gggcggcgcc gatcggcgtc gccccgctcg gctatcacac gctcgtcgac 9960 

ccggcgggcg aggtcgccac cgccgcggcg gccggcgcgg ccgggctgcc gctcgtggtg 10020 

agcacgttct ccgggcggac cgtggaggac atcgccgcgg ccaccaccgc gccgcgctgg 10080 

ttgcaggtct attgcttccg cgaccgggcg gtcaccgccg cgctcgtcac gagggccgtc 10140 

cgcgccggct tcgaggcgct ggtgctcacc gtcgacgcgc cgcggctggg ccgccgcctg 10200 

cgggacatcc gcaacgactt ccgcctgccg cccggcgtgg cgccggcaaa cctcaccggc 10260 

gacggcttcg cgtcgcccag cgggcacgcg ctcggcgcgt tcgacgccgc gatggactgg 10320 

accgtcgttg cctggctgcg ggagctcagc gggctgccgg tgctgctcaa gggcgtgctg 103 80 

accgccgacg gtgcccggcg ggcgctcgac gcgggtgcgg acgggatcgt cgtctccaac 10440 

cacggcggcc ggcagctcga cggcgtgccg gcgacgctcg acgtgctgcc cgaggtggtg 105 00 

gcggccgtgg ccgggcgctg cccggtcctg ctcgacggcg gcgtgcggcg cggccgcgac 10560 

gtcctgctgt cgctggccct cggcgccgac gcggtcctgg taggccgccc ggtgctgtac 10620 

ggcctcgcgg tcggcggcac ggccggcgtg cggcacgtgc tcgacatcct cgcgggggag 10680 

ctgaccgacg acatggccct ggcgggcgtg gcctcgcccg cggacgccgg cgcggacctg 10740 

gcgggcccgg tcgcgccgta gaggcggtcc atacgactgc ggcggccggg aatacccggc 10800 

cgccgcaacg tcgtcctcca gggacatctc aggacggcac ggccggctgg cgctcctcgg 10860 

tgcgctggcc ggcgaactgg gtccggtaca gctcggcgta caggccaccg tgcgcgatca 1092 0 

gctcgtcgtg ggtgccgcgc tcgacgacgc ggccgtcgtc gatgacgagg atctggtcgg 109 80 

cgtccaggat cgtggcgagc cggtgggcga tgacgagcga cgtacgcccg gcgagcgccg 1104 0 

tgtccagggc ccgctggatc gccgcctccg attcggagtc cagatgggcg gtggcctcgt 11100 

cgagcaccac caccgggggc gatttgagca gcaggcgggc cagggcgagg cgctgcttct 11160 

cgccgcccga gagccggtag ccgcgatcgc cgaccaccgt gtccagcccg tcgggcagcg 11220 

aggacaccat ctcccagatg cgggccgcct cgcaggccgc gacgaggtcg cgctcgccgg 11280 

cgtcgggacg gccgtagagc aggttggccc ggatggtgtc gtggaacagg tgcgcgtcct 11340 

gcgtgaccac gccgatggac tcgctcagcg agcgcagggt gaggtcgcgg acgtcgtggc 114 00 

cggcgatccg gaccgtgccc gaggtggtgt cgtagaggcg cggcaccagg tgggtgatcg 114 6 0 

tggtcttgcc ggcgcccgac gggccgacca gggcggtgag cttgccggcc ggggcgagaa 1152 0 

agctgatccc gttgagcacc ggccgctccg cggtgccgtc ggaggaccgc tgggccacgg 11580 



tctccaacga ggccagcgag acctcgtccg cgcccggata gcggaacacg acgttgtcga 11640 

actcgatgtc cggcgccgcc gagcggcccg gctcggcggc cggcagggcg cgggcgcccg 11700 

ggcgctcctt gacgagcggg tcgaggtcga gcacctcgaa gacccggtcg aagctcacca 11760 

gcgcggtgac gacgtccacc tggatgttgg tgagctggtt caccgggccg tacagctgcg 1182 0 

ccagcagggc gaccatggcg acaagggtgc cgatgccgag cgtgccgtcg atgaccaggg 11880 

cgccgccgaa gccgtagacc atggccgtgg tcaccgtggt cagcagcgtg gcgatgatga 11940 

acagcagccg tgcgtgcacg cccatcgaga tggcgatgtc acgcacccgc gcggcccgcc 12000 

cggcgaaggc ggtctcctcg ctctccggcc ggccgtagag cttgaccagc atggcgccgg 12060 

agacgttgaa ccgctcgttc atcatcgagc ccagctccgc gtcgacctgc atgccgccgc 12120 

gggccagccg ctccagccgg cccgcgatga gcttgccggg caggaagaac agcgggatga 12180 

gcaccagcgc caccagcgcg atcgcccagg agaggtagaa catcgcgccg atgaccagca 12240 

cgacggtcag gaccgtggag accgtctgcg tgatcatcga ggtcatggcc tgctcggcgc 123 00 

cgaccacgtc ggtgttgagc cggctcacca gcgaccccgt ctgcgcccgg gtgaagaacg 12360 

ccagcggctg gcgcatgacg tgggcgaaca ccttggtgcg > caggtcgtag atgagaccct 12420 

gcccgacgcg cccggaggcc agcgtctgca cgtggatcgc cgccacgttg accagggcga 12480 

ggccggccac caccagcgac agcccgacca ccacgtccag gcggccggcc acgatgccgc 12540 

ggtcgatgat ctgcttgagc agcagcgggt tggcgaccgt gagcgcggcg tcgaccacgg 12 600 

tcatcagcag ggcgacggcc agggaccagc ggtgtttctt cgcgtacggg aggatgcgcc 12660 

gcaccgtgcc gggcctgacc ttctgcatcg gtatgaggcc gtcgacgcgc agcccgatga 1272 0 

cgcccagatc cgggccgtcg atggttgcca cggaaccttc tctctcgcgg tagcgggtgc 12780 

gccggtcgtg ccggcgcgtc cctagtggga gtcgaggaac tcgaggatcg cggcgttcac 12840 

cgcgtccggc cgttccagat agccgaggtg cccgcagcgg gatatctcga ccaggtcgca 12900 

gtcggggatc gcctcggcca cctcggcggc caggtgcggc ggcgtgatga ggtcgtcggc 12960 

gaacgagatc acccggcagg gcgcggtgac cgagcgcagg gcggcgcgcc ggtcaccggt 13020 

cagctcggcc caggcctgcc cgccggccga gacgcccgtg ccggagagct cgaagatgtc 13 080 

gagccactcg cgcacggcca cgtcgtcgtt gagcgtcgcc ggggagaaca tcttgaagac 13140 

cgcggtcgcg gcctcgtacg cggcgggcag cgtgaccccg ctctccagca ggtcgatgtc 13 2 00 

ggcctggttc tgcgccgcgc gggccgcgtc gggccgggcc agggtcgcga tcaggaccgc 132 60 

gcagcgcacc agctccggat ggtcgacggc cagctcctgg gcgatcatcg cgcccagcga 13320 

ggtgccgacg atccggcagg gcgcgagatc gagggcctcg atgagaccgc gggtgtcggc 13380 

ggtcatgtcg gcgagggagt acttgccggc gggcacgtcc gagggcggga tgccacggct 13440 



gtcgaagacg 
ggtctggccg 
gcgatacgcc 
cgccactcgc 
aggaactcgt 
tggccctgcc 
gtcatgatcg 
ttcttgttga 
cactccatgc 
agcaccgggt 
acgaagtacg 
gcggtgaccg 
aaaccgatcc 
cgcggcacga 
tggtcctggt 
gtgtaccggt 
tgctcggtct 
atgtgccggt 
ccggcgtcgt 
gcgtcgaaga 
gcccggggaa 
agctgcaccg 
aagtgcagcg 
gccggcagcc 
tcctcggcct 
gtcttgagca 
ccgcgcatct 
tcgagcagag 
tgcagccacg 
ctcggctgat 



acggtggaat 
gccgagcccg 
aggcggaccc 
gcttgttgaa 
cgatctgctt 
aggactcctc 
cgccggcctg 
ccggcttggt 
cgaggaacgc 
cgatgcccga 
tcgacttccc 
tgccgcccgg 
ggcgcaggta 
caacggtgcc 
gcccgtgcgt 
cgccgtggcg 
gcatcaccac 
cgggcgacag 
cgagctccag 
gctcggtcag 
gccgcggtgt 
actgccgccg 
agggctggtt 
gggggtagag 
cggccaccgc 
gctcggcgac 
tcgggttcgt 
gcacctgcag 
cgtagaaacg 
cggcgttgcc 



atcccgcctc 
atcccatgat 
cgttggtggt 
caggtgctcg 
gagctggtag 
gcccatggcg 
ctccgcgttc 
cagcagtccc 
catgtcgacc 
cgagtcggcg 
gcggatgtcc 
gaacggcacc 
cagcgcgatc 
gatccggccg 
gatgagcacc 
gttgtcggtg 
gcaggcgtga 
gctcggctcg 
cgcctcgcgc 
cccgggatgc 
gctgaggatg 
ctcgtcgtag 
gctggtgtcg 
cggcgtcagg 
ggcggcgaag 
ctcgtcgctg 
gctcgccgcc 
gaacgccagg 
gtcgacgagc 
catgtgaacc 



gtgcaacgcg 
catgagaacg 
gacgaaaccg 
gcggtgaagc 
gtgtcctcgt 
tagatgtatc 
gatccggaca 
ttgtagagcc 
tggccgacgt 
ccgatgaaga 
agatcggcgt 
tcgtcgaact 
gacgggtccg 
cgcagctgca 
aggtcgatgt 
ctgatgaacg 
ccgtagtagc 
tcggtgagca 
aggcggttga 
cgcaggggca 
aacgggcgct 
gccgggctgc 
tacgcgatct 
tcgtagccga 
ccgagcatgt 
cgggattcct 
acgtgcaccg 
ttcatggagg 
ggttcggcga 
tctccatcct 



ggcgtctggt 
ggctcgccgg 
gagcccgcgg 
cccggtccgc 
tgtaggtcgt 
cggcggtggc 
gccgccgtga 
agttgagcgg 
ggtcgcggat 
tggtgcgccc 
gctcgcccag 
cctcgacctc 
ggaggttgcc 
gcagggtctc 
ggtcgggcag 
ggtcggtgac 
ggacccggcc 
gggcatccag 
gcgaggtggg 
gcggaaggtc 
ccaccccgtc 
ggtagagcag 
ccaccagccc 
ccgcgttctc 
cggcccgctc 
cgacggcgac 
gcggcgactg 
ccggcaccgg 
tgacgttgga 
gatggcgggt 



gcaccgtcca 
ctccggagcg 
cgctcaccag 
gcaccacgcg 
cgccatgacg 
gccgagctcg 
ggcgctcatc 
cgcgccgtcg 
gtagcggtag 
ggccatccgc 
gaacggcgtg 
gacgacggtg 
gcggctggtg 
gagcacgatg 
gtcgtcgagc 
caccgccgcc 
gccggactcg 
cgcggccgcg 
ccgcacgcgg 
gagcacgcca 
gtcgagcgag 
cggctcgagg 
gttgagcgcg 
gcggatcagg 
gttcttgatc 
gaagtagccg 
caggtacgac 
tgccaccgtg 
gcgcaggtat 
gcgcccggag 



13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 



cgggcacgat cgggcgggag ggaatgcggc ctcggtacgg ccggcgcggc aaacggcgcg 153 00 

tccgtccctc cggcgtgggc gcggtccact atggcatcgg cccaggagcc ggtacagcgt 153 60 

gcacgtgcgc ggcccgatcg cgtcgaacgc cctgtacaac cggcggtgcg acgcagtcga 15420 

acagcggtga acgaccgagg cggcggcggc ccgttgatcg gccgcccgag cgcccgcata 15480 

gtgaatcgcc ggaggggcct tggagagccg gcccgccgtt cccggccatc gccgggcgcc 15540 

cgccctcaca aaccgatcgc ccggcgccgc cgggtcgcac ggaggtgccc atgtccgaga 15600 

ccgacctgtc cgccgcccgg cacacgcccg agcagatccg ctcctggctg atcgaccgga 15660 

tcgcctacta cgtgatgctg ccgacccagg agatcgagcc ggacgtgtcc ctggccgagt 15720 

acggcctgga ctcggtgtac gcgttcgcgc tctgcggcga gatcgaggac acgctcggca 15780 

tcccgatcga gccgaccctg ctgtgggacg tcgacaccgt cgccaccctc accgcccacc 15840 

tcgccgaccg cgtcaaccga taagccggag gtgaccgtcg tgcccacccc tgacctgcgc 15900 

ccgctcacgc ccgcccagct cgccgtctgg cacgcgcagc agctcgcccc gcacagcccc 15960 

gtctatcagg tcggcgagtt cgtcgagatc gacggcgagt gcgaccccga tctcctggtg 16020 

gcggcgttgc gtcaggtcat gggcgaggcc gagagcgccc ggctgcggtt ccgcgtgatc 16080 

gacggtacgc cgtggcagta cgtcgccgag gacggcgacg acccgatcca ggtcgtggac 16140 

ctcggcgcgg ccgcggaccc gcgcgccgcg gcgctgggcc gcatggcggc cgacctcgac 162 00 

cggcccggcg acctgcgcga cggcccgctc gtcgagcacc acgtctacct gctcggcgag 16260 

ggccgggtca tctggtacca ccgcgcgcac cacatcgtct gcgacggcgg cagcctcggc 16320 

attgtcgcct cccgggtggc cggcgtctat tccgcgctcg cggccggtgg tgacgtccgg 16380 

ccgggtgcgc tgccgccgct gtcggtgttg ctgtcggccg ccgacgccta cgagcgctcc 16440 

ggcgaccgcg accgggaccg cgagcactgg cgctccgcgc tggcgggcct gcccgccgag 16500 

ctgctcgcgg gcgcgggccg gccgcggccg ctgcccggac cgccggtgcg ccacgagcac 165 60 

gacctctccg cggcggaggc gggccggctg cgcgcggggg cgcggcggct gcggaccagc 16620 

gtggcgcagg ccggcatcgc ggccgcggcc ctctaccagc accggctcac cggcgcccgg 16680 

gacgtgctgg tggcggtgcc cgtcgccggc cgcaccaccc gcccggagtt cgacgtgccc 16740 

ggcatgacgt cgaacgtggt gccggtgcgc ctcgcggtca cgcccgccac gaccgtcggc 16800 

gagctgctgc gcgacgtcgc ccgtggtgtc cgcgacggcc tgcggcacca gcggtacccg 16860 

tacccgaaca tcgtggacga cctcggcctg gccgaccgtg ccgcgctgcg cccggtgacc 16920 

gtcaacgccc tggcgctggg acggccgctg cgcttcggct cggcggtggg tgtgcgctcc 16980 

ggcctgtcgg cgggcccggt ggacgacgtc accatcggcc tctacgaaaa ggtcagcggc 17040 

ggcggcatgc agacgatcgc cgagctgaac cccgggcgca cggaccgccc ggacgcggcg 17100 



gaggtctccc 


gctggttccg 


tacgctgctg 


cgcgggctgg 


ccgagagcga 


cgccggcgac 


17160 


ccggtggccc 


gcatcgacat 


cgtcgacgag 


cccgagcgcc 


gccggctgct 


ggacgagtgg 


17220 


aacgccaccg 


cggcgccgtc 


gagcgacacc 


gtcctcgcgc 


gtttcgagga 


gcaggcggcg 


17280 


cgtacgcccg 


aggcgcccgc 


cgtcgtctgc 


ggcgacgtga 


cggtcaccta 


cgccgagctg 


17340 


gaggccggcg 


ccaaccggct 


cgcccgcgtc 


cttcgcgcgc 


gcggcgccgg 


accggagtcg 


17400 


gtcgtggccc 


tctgcctgcc 


ccgcggcccc 


gaggtcgtga 


ccggcatcct 


cgccgcctgg 


17460 


aaggcgggcg 


ccgcctacct 


gccggtcgac 


accgaactgc 


cggccgagcg 


cgtcgcctat 


17520 


ctgctcggcg 


acagcgccgc 


cgccgtccgc 


ctcggcaccg 


ccgagacgct 


cgccgccctc 


17580 


ccggacggcc 


ccgccgccga 


cgtcgacgtc 


cacgcaccgg 


agatcgcccg 


ggaatcgccc 


17640 


tcgcccctgc 


ggctcgagcc 


cttgccggat 


cagttggcgt 


atgtgattta 


tacgtcgggt 


17700 


tcgacggggt 


tgagcaaggg 


tgtcggtgtt 


tcgcatggtg 


ggttggcgaa 


ttatgtgggt 


17760 


tgggcgtcgg 


ttttgtatgg 


gggtttgtcg 


gcgccgttgc 


attcttcgtt 


ggcttttgat 


17820 


ttgacggtga 


cgagtgtttt 


tgtgccgttg 


gtgtgtggtg 


gttcggtggt 


ggtgtcggcg 


17880 


gccggtggtg 


gtcggggttt 


ggcgtcgttg 


ttggcggctg 


gtgatggttt 


ttcgttggtg 


17940 


aaggtggtgc 


cgggtcattt 


gcgtttgctg 


gcggagttgg 


tgccggcggg 


tgagatggcg 


18000 


gcggtgggtt 


cgttggtggc 


cggtggtgag 


gttttggccg 


gtggtgatgt 


gcgtgagtgg 


18060 


ttgtcgcggg 


tgccgggttc 


ggtggtggtg 


aatgagtacg 


ggccgacgga 


gaccgtggtg 


18120 


ggttgttcgg 


ttttctcggt 


ggccgcgggt 


gatgtggttg 


gtgatgtggt 


gccggttggc 


18180 


cggccggtgg 


cgaatacgcg 


tttgtttgtt 


ttggatgagg 


gtttgcggcc 


ggttccggct 


18240 


ggggtggcgg 


gtgagttgta 


tgtggctggt 


tcgcaggtgg 


cgcggggtta 


tgtgggtcgt 


18300 


tctggtttga 


cggcttcgcg 


ttttgtggcg 


tgtccgttcg 


gtgtgggtga 


gcggatgtat 


18360 


cgcacgggtg 


atgtggtgcg 


gttggccggt 


ggtgatctgg 


tgtttgtggg 


ccgggtcgat 


18420 


gagcaggtga 


agattcgtgg 


ttatcgggtg 


gagccggatg 


aggtgcggtt 


ggtggtggcg 


18480 


gggcatccgc 


gggtagcggg 


tgcggcggtg 


gtggctcggc 


cggatgcggt 


gggtgagcgg 


18540 


cagttggtgg 


cctatgtggt 


cgctgccggt 


gagccggccg 


ggttggcgga 


gtcggtacgc 


18600 


gcccacgtcg 


ccgagcgcct 


gcccgaatac 


atggtcccgg 


ccgccgtcgt 


gaccctcgac 


18660 


gagatcccgc 


tgaccgtgaa 


cggcaaggtc 


gaccgcgccg 


ccctgcccga 


gcccggcccc 


18720 


gtcgccaccg 


gcaacgccga 


ccgcgagccc 


acgaccgagc 


gcgaatcgct 


gctctgcggc 


18780 


gccttcgccg 


acgtgctcgg 


catcgagcgg 


gtcggcgtcg 


acgacgactt 


cttcagcctc 


18840 


ggcggccatt 


ccctgctcgc 


caccagcctg 


gtgagccggg 


tacgcctcgt 


gctcggtgag 


18900 



gaactgccca tcgaggagct cttcgccacg cccacccccg ccgagctggc ggcctggctg 18960 
caacgcaacg cggaccggcc gcaaccggcc cggccggcgc tgcgcccgat gcacgaaagg 19 02 0 
gaaacgaccg catgaccccg atgtcgtacg cccagcgccg tctctggttc cagctgcggg 19080 
tcgagggccc cgacgccacg tacaacagtc ccgccgtcct gcgcctcacc ggcgagctcg 19140 
acaccgccgc cctggagcac gcgctgcgcg acgtcctcga acggcacgag gtcctgcgca 19200 
cggtctatcc cgacgtcggc ggcgagccgc ggcagcgcgt ggttcgcccg gacgacatgg 19260 
tgtgggagct gcccacgacc cgggtgtccg gtgccggcgc gggcgacgac cggctcgtca 19320 
cgctcgacga gctgccctgg gaccgcccgg tgctcgacct gccgtcgccc gcaccggccg 19380 
gccgggaacc ggacggcgag atcaccgtcg acgagctgcc cggcgcgatc gcccgggtgg 19440 
cggcccaccc cttcgacctc tccatcgaga tcccggtgcg ggcgcggctg ttcgccctgg 19500 
gcccgcggca ccacgfccctc gtcgtcgtgc tccaccacat cgccaccgac ggcagctccg 19560 
gcgggccgtt cgcccgcgac ctcgccgccg cctaccgcgc ccgccgcacc ggaacggcgc 19620 
cccagtgggc accccttccg gtgcagtacg ccgactacgc ggcctggcag caggaactgc 19680 
tcggcgccga ggacgacccc gacagcgtca tctcgcggca gctcgcccac tggcaggagc 19740 
ggctg^ccgg catgccggtc gagctggatc tgccggccga ccgtccgcgg cccgccgaac 1980 0 
ccgggcacgg cgggcacacc aaggcgctga gcctgccccc ggccgtgcac cgaggactgg 19860 
ccacgctggc ccggcggcgc cgcgccacgc ttcaaatggt cgtgcagacc ggcgtcgcga 19920 
tcctgctgtc caagctcggc gccggccgcg acgtcccgct cggcatcccg gtcgccgggc 19980 
gcaccgacgc cgcgctcgac gacctgatcg gtttcttcgt caacaccttg gtcgtacgcg 20040 
ccgacctgtc cggcgacccc acggtggccg acgcgctggg ccgggtgcgc ggaggcgcgg 20100 
tggccgccct ggccgaccag gacgtgccct tcgacaaact cgtcgagcgg ctggcgccgg 2 0160 
cccgcgtgct cgggcggcac ccgctgttcc aggtcatggt cgcgccgctc gacgacggga 20220 
cgccgatcga cctggacggg gtgcgaggcg agccgctcac catcggccgc tccggtgcca 20280 
agttcgacgt cgaggtgatg accggagagg tacgcgcggc ggacggcgcg ccggccggca 20340 
tccgcggaat cctgacgctc tcggccgacc tgttcgacga ggcgacggcc ggccggatgg 2 0400 
ccgccgggct ggtgcgggtg ctgaccgcga tggccgaggc cccggagcgg cggctctccg 20460 
gcatcgaggt gctgtcgccg ggggagcggt cgcggctgct ggtggagtgg aacgacacgg 20520 
ctcgtccggt ggtggagtcg tcggtgccgg cgttgttcgc gaagcgggtg gcggccacgc 20580 
cggacgcgac ggcggtggtg ggcgagggtg tgtcgtggtc ctatcgcgaa ctcgatcgtc 20640 
gctcggatgt gctggcgcgg cgtctggtgg cggcgggtgt gggtgtggag tcgccggtgg 207 00 
tggtggcgtt' ggagcggtcg ccggaggtgc tgtccgcgtt tttggcggtg gcgaaggccg 20760 
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gtggtgtgtt tgttccggtg gatttgtcgt ggccgcaggc gcgtgtcgat gcggtggttg 20820 
ctgactgtgc cgcgcgggtg gcggtggctg accggccgat gagcgggctg acggtcgtgt 20880 
ccgccggcct gggcggggat tcggccgtcg tgtccgccga cctgaccgcg gatcgggccg 20940 
ttgtgttgcc gtctcgcccg gtgccgggtg cggcggtcta ccggatgtac acctccggct 21000 
cgacgggccg ccccaagggt gtggtgacca cccaccagaa cctggtggat ctggcgaccg 21060 
acacgtgttg gggtccgacc ccgcgggtgc tgttccacgc cccgcacgcc ttcgacgcgt 21120 
cgtcgtacga aatctgggtg cctctgttga atggcggcac ggtcgtggtc gccccgcagc 21180 
gcagcatcga cgccaccgtc ctgaaggatc tgatccgcgc gcatgatttg acgcacgtgc 21240 
acgtcaccgc gggcctgctg cgggtgctgg acccgtcgtg cttcgcgggc ctgaccgagg 21300 
bgctcacggg tggcgatgcg gtgtcggcgg aggcggtgcg ccgggtcaaa gacgcgaacc 213 60 
cgggtctgcg ggtgcggcag ctgtacggcc cgaccgaggt gacgctgtgc gcgacgcagc 2142 0 
atctgctgga tgacggggtg ccgatcgggc ggccgttgga caacacccgc gtgtatgtcc 21480 
tggacgactt gctgcagccg gtgccggtcg gtgtgaccgg cgagctgtat gtggccggtg 2154 0 
ccggcgtggc gcgtggctat gcgggcatgc ccgggttgac ggccgagcga tttgtcgccg 21600 
acccgttcaa caccggcggt cgcctctacc gcacgggtga tctggtgcgg tggaccgatg 21660 
acggtgtgct gcatttcgcc ggccgagccg atgatcaggt gaagatccgc ggctatcggg 21720 
tggagccggg cgaggttgaa gcggttctgg ctcaacatcc cgatgtcagc caggtcgcgg 21780 
tggtcgtccg ggaggacacc ccaggcgaca agcgtcttgt cgcgtatgtc gtcggcggtg 21840 
acatcgaggc gtatggccag gagcgccttc ccggttacat ggtcccgtcg gcattcgtac 21900 
atctggatgc gttgccgctg accagcaacc agaaggtgga tcgggccgca ctgcccgcgc 21960 
cgtccatgga gtccggcgcc ggtcgagctc ccgcggatgc ccgcgaggag ttggtgtgtg 22020 
ccgcgttcgc cgaggtgctc ggcctggatc gggtcggtgt cgacgacgac ttcttcgctc 22080 
tcggcggtca cfccgctgctt gccgtctccc ttgtggagga tttgcggcag cgtggcctgc 22140 
acgtctccgt tcgcgcgctc ttcgccacgc cgacccccgc ggcgctggcc gtctcgaccg 22200 
tggcggcccc gatcgaggtg ccgcccaacc tcatccccca gggcggcgcc cgggaactga 22260 
cccccgacat gctgcccctg gtcgacctga ccggcgagga gctggccacc atcgtggccg 22320 
cggtgcccgg cggcgctgcc aacatcgccg acatctaccc cctagccccg ctgcaagagg 22380 
gcatcttctt ccaccacctc atgaccgagg gcgacaccgc cgacgtctac gcgctgccgt 22440 
acctgctgcg cgttggcacg cgtgagcagc tcgacgcctt cctcggggct ttgcagcagg 22500 
tggtggaccg ccacgacgtc taccgcacgg ccatcgcctg gcagaacctg cgcgagcccg 22560 
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cgctcttcca cctcgcgtgg gcgcgggtgc tcggcacgct ggccgggcgc gacgacgtcg 2 6280 
tcttcggcac cgtcctgttc ggacggatga actcgggtgc cggcgccgac cgggtctccg 2 63 40 
gcctgttcat caacacgctg ccggtgcggg tgcggctcgg cgcgcccacc ggcgacgccc 26400 
tcggcgacct ccgcgaccag ctcgccgagc tgctcgtgca cgagcacgcc tccctcgcct 26460 
ccgcgcagaa ggcgagcggc ctgcccggcg ggagccccct gttcacgtcg atcttcaact 2 652 0 
accggcacaa ccaggtgagc gccgaacggg agaccgccgc gctgcccggc atccgcgtcc 26580 
tcgcggcccg cgactccacg aactatccgc tcaccgtcgc ggtggacgac gacgggcacg 26640 
gcttcacgct cgtggtcgag gtcgcgtcca cagtcgacgc cgcgggcgtc tgcgaactgc 26700 
tgcacaccgc cgtggacaac ctcatcgcgg ccctcaccga ccggccgggc gggccgctgg 2 6760 
ccgaggtcga catcctcgaa cgcggtctgc gggaccgcct gctgaccgcc tggaacgagg 26820 
cccgggagcc cgcaccgccg gtgaccctgc ccgacctgtt cgaccggcag gcccgccgca 26880 
cgcccgaggc ggtcgcgctc accgcggacg gcgtctcgct gacctaccgc gagctgtcgg 26940 
agcgcgccaa ccggatcgcc cggctgctca cctcccgggg gatcggcccg gagtcgctcg 27000 
tcggtgtcgt cctgccccgc tcggccgacc tggtcgttgc gctgctcggc gttctccagg 27060 
ccggcgcggc ctacgtgccg gtcgacgccg actacccggc cgagcggatc gggtacatcc 27120 
tcggcgacgc gggcgcggtt tgcgtgctca cagtggacgc gaccgcgggc gccgttcctc 27180 
ccggcgtacc gaaactggtg ctggaccacc cggaaaccgt gaccgcgctg gcggcgtgtg 27240 
acacggcacc gctcggcgag gccgagcggg ccggcgaact gctgccggag caccccgcct 273 00 
acgtcatcta cacctccggc tccaccggca cgcccaaggg cgtgctcatc ccgcaccgca 27360 
acgtggtcga gctcttcgcc gccacccgcg gctcgttcca cttcggcgag ggtgacgtgt 27420 
ggtcgtggtt ccactcggtc gccttcgact tctcggtctg ggagctgtgg ggcgcgctgc 27480 
tgcacggcgg ccgcgtggtc atggtgccgt tcgccgtctc ccgctcaccg cgggatttct 27540 
gggaactgct cgtgcgcgag cgggtcaccg tgctcagcca gacgccgtcc gccttctacc 27600 
agctcgccgc ggccgccgac gacacgccgg acgcgctgcg cgtggtggtc ttcggcggcg 27660 
aggccctcga cccgggacgg ctcgccggat ggcgggaacg gcgcccggac ggaccgcgcc 27720 
tggtcaacat gtacggcatc accgagacga cggtccacgt cacccaccag gacctcgcac 27780 
cggccgacac caccggcagc cccatcggac gcggcatacc gggcctgtcc gtctacgtcc 27840 
tcgacgaggc gctgcggccg gtgccgcccg gggtcgccgg cgaggtgtac gtggccggtc 27900 
gccagctggc ccgggcctac ctcggccgcg ccgcgctgac cggcacccgc ttcgtcgcct 27960 
gccccttcct cccggccggg gaacggatgt accgcaccgg cgaccgtgcc cgctggagcc 2802 0 
gcggccggtt gcagttcgcc gggcgcaccg acgaccaggt gcagatccgc ggcttccgca 28080 
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gggcgcgggt gcgtccggcg cgggtgccgt tgtcgttcgc gcaacgccgg ttgtggttcc 31800 
tgtcccagct ggagggtccg agcgcgacct acaacatccc ggtggcgctg cggctggacg 31860 
gtcctctgga tcgcgatgct ctaacggcgg cgttgcacga tgtggtggcc cggcacgagg 31920 
tgttgcgtac cgtcttcacc gtcgccgacg gcgagccgtg gcaacagatc ctcgacgatc 31980 
cgcaggtctc cgttccggtc gtcgaggtca cgcccgaccg gctgcccgag gcggtggccg 32040 
tcgccgcggg gcaccggttc gacctcggcc gggaactgcc gctgcgggcg gtcctgctgg 32100 
cgaccggcga cgacgtgcac gtgctggtgc tcgtggtgca tcacattgcc gccgacggct 32160 
ggtcgatgcg gccgctcgcc cgggacttgg cggccgccta cgcggccagg atcgacgcga 32220 
cggcgccggc cctcggcgcg ctgccggtgc agtacgccga ctacgccctc tggcagcgcg 32280 
acgtgctcgg ttccgagcac gacccggaca gcgtcatctc ccaacaggtt gcctattggc 32340 
ggcggcagct ggccggcgta ccggaggaat tggatctgcc ggtggaccgg gcgcgcccgg 32400 
ccgaggcatc gcatcgcggc cacaccgtgg agttcgccgt gcccccggcc gtgcaccacc 32460 
aactcgccga actcgcccgc cgcaacggcg tcaccgtctt catgaccgtg caaaccgccc 32520 
tcgccgtcct cctgtccaaa ctcggcgccg gcaccgacat ccccatcggc gtcgccgtcg 32580 
ccggacgcac cgaccccacc ctcgacaacc tcatcggctt cttcgtcaac accctcgtcc 32640 
tacgcaccga cctgaccggc aaccccacca tcaccgacct gctgcaccgc acccgcgaca 32700 
ccaccctgca cgccttcacc caccaagacg tccccttcga aaaactcgtc gaagacctcg 32760 
cacccacccg ctccctcgcc cgccaccccc tcttccaggt catgatgacc ctgcagagcg 32820 
cgtcggcgga cgaagagccg ctggcgctcg ccgggctgcg ggtcaccgac ctcccggccg 32880 
gggagacacc cgccaaggtg gacctcgacc tgacgctgca cgaggtggcg ggccgagacg 32940 
gcatgcacgc cacgctcctc ggcgcggccg acctcttcga gcaggagacg gtgcgcgccc 33000 
tcgccgaccg gttgctgcga accctggaag ccatggcggc ggccccggac gaccgcctcg 33060 
accggatcga ggtgctgtcg ccgggggagc ggtcgcggct gctggtggag tggaacgaca 3312 0 
cggctcgtcc ggtggtggag tcgtcggtgc cggcgttgtt cgccgagcag gtggcggctg 33180 
cgccggacgc ggtcgcggtg gtgggcgagg gtgtgtcgtg gacctatcgc gagcttgatg 33240 
cgcggtcgga tgcgttggcg cggagcctgg tggcggcggg tgtgggtgtg gagtcgccgg 333 00 
tggtggtggc gttggagcgg tctccggagg tgctgtccgc gtttttggcg gtggcgaagg 33360 
ccgggggcgt gtttgttccg gtggatttgt cgtggccgca ggcgcgtgtc gatgcggtgg 33420 
ttgcggactg cggcgcgcgg attgcggtgg ctgaccggcc gatgagcggg ctgacggtcg 33480 
tgtccgccgg cctgggcggg gattcggccg tcgtgtccgg cgacctgacc gcggatcggg 33540 



ccgttgtgtt gccggccggt ccggtgccgg gtgcggcggt ctaccgcatg tacacctcgg 33600 
gttccacggg tcggcccaag ggtgtggtga ccacccacca gaacctcgtg gatttggcga 33660 
ccgacacgtg ttggggtccg accccgcggg tgcttttcca cgccccgcac gccttcgacg 33720 
cgtcgtcgta tgagatttgg gtgccgttgt tgaatggcgg cacggtcgtg gtggctccgc 33780 
ggcgcagcat cgacgcgacg gtcttgaggg acctgatcgg cgcgcatgag ttgacgcacg 33840 
tgcatgtgac cgcgggcttg ttgcgggtgc tggacccgtc gtgcttcgcg ggcctgaccg 33900 
aggtgctcac gggcggggat gcggtgtcgg ccgaggcggt gcgccgcgtc aaagacgcga 3 3960 
atccgggtct gcgggtgcgg cagctgtacg gcccgaccga ggtgaccttg tgcgccacgc 34020 
agcatctgct ggatgacggg gtgccgatcg ggcggccgtt ggacaacacc cgcgtctacg 34080 
ttctcgacga cctcctgcgg ccggtcccga caggtgtggt gggggagctg tatgtggccg 34140 
ggtcgggtct ggcgcggggc tatgcgggca tgcccgggtt gacggccgag cgattcgtcg 34200 
ccgacccatt caacaccggc ggtcgcctct accgcacggg tgatctggtg cggtgggccg 34260 
acgatggtgt gctgcatttc gctgggcggg ccgatgatca ggtgaagatt cgtggctatc 3432 0 
gggtggagcc gggcgaggtt gaagcggtcc tggctcaaca ccccgacgtc agccaggtag 34380 
cagtggtcgt ccgcgaggac accccaggcg acaagcgcct ggtcgcctac gtcgtcggcg 34440 
gggatgtcga ggcgtatgcg caggagcgcc ttccgggcta catggttccg tcggctttcg 34500 
tgcagttgga tgcgctgccg ctgaccagca accagaaggt cgaccgggcc gctctcccgg 34560 
cgccatccat ggagtctggc gccggccggg cacccgccga cgcgcgtgaa gagctgatgt 34620 
gtgccgcgtt cgccgaggtg ctcgacctgg atcgggtcgg tgtcgacgac gacttcttcg 34680 
ccctcggcgg gcattcgctg ctcgccgtct ccctcgtgga gaatctgcgc cgccacggcg 34740 
ttcacatctc cgttcgggcc ctcttcgcca cccccacgcc ggccgcgctg gccgcctcgg 34800 
cgggaaccgc cgtcccggac gtgccgccca acctcatccc ccagggcggc gcccaggaac 34860 
tgacccccga catgctgccc ctggtcgacc tgaccggcga ggaactggcc accatcgtgg 34920 
ccgcggtgcc cggcggcgct cccaacatcg ccgacatcta ccccctagcc ccgctgcaag 34980 
agggcatctt cttccaccac ctcatgaccg agggcgatgc caccgacgtc tacctcctgc 35040 
cgcggattct cggcttcggc ggccgtcccg agctggacgc cttcctcggg gccctgcagc 35100 
aggtggtgga ccgccacgac gtctatcgca cggccatcgc ctggcagaac ctgcgcgagc 35160 
ccgtccaggt cgtgcaccgc cacgccaccc tgcccgtcac cgaagtcacc cccgaccagc 3522 0 
tgcacgccgc cgccaccggc ggccggctcc cgctcgacca cgcacccctg ctcagcgtcc 35280 
acatcgcacc cgaacccgac ggcggctggc tcgccctgct ccgcatgcac cacctcgtgc 353 40 
aggaccacac cgccctcgac atcgtcctcg acgagatccg caccatcctc gccggcgcaa 35400 
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cctcgcaccg cggcggcgag gtgccgatcg ccatacccgc cgacctgcac cgccgcctgg 37260 
ccgagctggc cgtcgccgag cgggccaccc tcttcatggt tctgcaggcc ggcttcgcgg 37320 
cgctgctgag ccggctgggt gccggcaccg acgtcccgat cggcaccgcc ctcgccggcc 373 80 
gcaccgacga cgccctcgac gagctggtcg gcttcttcgt caacatgctt gtgctgcgca 37440 
ccgacgtgtc cggcgacccc gggttcggca cgctgctgcg ccgggtgcgc gagaccggcc 37500 
tcgccgcgta cgcccatcag gacgttccct tcgaccaggt cgtcgaggag ctggtgaccg 37560 
agcgttccct ggcccggcat ccgctgttcc aggtggcgct gaccgtgcag aacgcgccgg 37620 
gcgcgcggcc gcggctggcc ggcctcgaag tcggcaccga gccgatcgag cacggcatcg 37680 
cccgctacga cctcacgctc accgtgaccg agcggcgcga cgagcacggc gcgccggacg 37740 
gcctggaggg gcatctcgag ttcagccgcg acctgttcga cgcgccgacc gtcgcgaccc 37800 
tcggcgaccg gctgatccgg ctcctcaccg ccgcggtggc cgacccggag ctgccgctga 37860 
gccggatcga tctgatggcc cccgcggagc gccgcaacgt gctcgagggc tggagcaccg 37920 
cccggcgcga cgtgccggcg gccaccgtgc cggagctggt ggccgcgcag gtggcccggc 379 80 
gcccgggtgc ggtcgcgctg cggtcggagg acggcgagat cacgtacgcc gagctggacg 3 804 0 
cgcgcgccgg gcggctggcc gcggtgctgc gccgccgcgg gatcgggccg gagtcgcggg 38100 
tggccgtgct gttgccgcgc ggcgtcgagc aggtggtcgc cttcctcgcc gtggtgcggg 38160 
ccggcggcac ctatctgccg atcgatcccg cgtacccgcg cgaccgcgtc gactacctcg 38220 
tccgggacgc ggagccggcg tgcctgctga ccgtcgccgg gcatcgcgcg gcggcgccgg 3 8280 
cggcaccggc ggtggtcgag ctggacgacc cggcgaccgc ggccgagatc gccgatgccg 3 8340 
aaccggagcc gccggtcgcg gtgcggccca cccactccgc ctacctgatc tacacctcgg 38400 
gctcgaccgg gcggcccaag ggcgtcgtgg tcacccaccg gggtgtggcc gcgctcgtgg 3 8460 
ccacccaggc cgagcggctc gcggtgaccg gcgagagccg cgtcctgcag ttcgccagcg 38520 
tgggcttcga cgcctcgatc tgggagatgg tgatggccct gtgcgccggc gccaccctgg 3 85 80 
tcgtcgcccc ggccgacgac ctgctgcccg gcccggccct ggccgccacg ctgtccgggc 3 8640 
acgcggtgac ccacgcgacc ctgccgccgg ccgtgctcgc cgcgtccgcg cccggcgatc 38700 
tcgcgccgct cgccgtgctg gtctcggccg gtgaggcgct cgggccggac ctcgtccggc 38760 
agttcgcgcc cggccgcgcg ctggtgaacg cgtacgggcc gaccgagacc acggtgtgcg 3 882 0 
ccacggcgtc cgccccgctc ggcccggagg atcccccgca catcggcgcc ccggtcgccg 38880 
attcccgggt ctacgtgctc gacgacgcgc tcaccccggt gccgcccggg gtcaccggcg 38940 
agctgtacgt ctcgggggcg tcgctggccc gcggatatgc cggtcgcgcg gccctcaccg 39000 
cggaacgctt cgtggcctgc ccgttcgcgc cgggtgagcg gatgtaccgg accggggacc 39060 



gggcccgctg ggacgcggcc ggccggctca cgttcgccgg gcgcgccgac gaccaggtca 3912 0 
agatccgcgg gttccgggtg gaaccgggtg aggtggccgc ggtgctcggc gaacacccgg 39180 
cggtcgcccg ggcggcggtc gtggcgcgca cggacgggcc gcagggcgcg cggctggtgg 39240 
cgtacctcgt cgccgccgac ccggccgggc ccgacctggc cgctgcggtg cgcgcgtacg 39300 
ccgccgcgac cctgcccgcg cacctgcttc cggccgcgtt cgtgccgctc gaccgcctgc 39360 
ccctgaccac gaacggcaag ctggaccggg ccgcgctgcc cgaaccggag accggcgccg 3 942 0 
ggcgcgagcc gtccggcccc gtcgaacggc tgctctgcga ggcgttcgcc gacgtgctcg 39480 
gcctcgaccg ggtcggcgcc gacggccact tcttcgacct gggcggccat tcgctgctcg 39540 
ccacccggct gctcagccgg ctccgctcgg ccgcgggcat cgacgtcccg gtccgggtgc 39600 
ttttcgagaa ccccactccc gccgggctcg ccgcctgggt ggagacccac gccggatccc 39660 
gccggaagtc ccggccggcg ctgcggccga tgcgtcacca gaaggagtcc tgatgatccc 39720 
cctgtcgttc gcgcagcgcc gcctgtggtt cctcggccgg ctcgaggggc cctccgccac 39780 
ctacaacatc ccgctcgtgc tgggcctgac cggcaccgtc gacgccgccg ccctcgaaac 39840 
cgccctgcgc gacgtgctgg agcggcacga ggtgctgcgt accgtctatc cggacgccgg 39900 
cggcgagccg caccagcgga tcctgccgct cggcgagacc ggcttcggcc tgcgggtcgc 3 9960 
cgaggtgacg gacggcgagc tggacgcggc cgtcgcggac gccaccgggc acgccttcga 40020 
cctcgcgacc gagatcccgg tccgggcctc gctgctcacc gtcgagccgg gccggcacgt 40080 
cctggcgctg gtgctgcacc acatcgcggc cgacggctgg tcgatggggc cgctgctgcg 40140 
cgacctgtcc accgcgtaca cggcccggct ggccggcggg gaaccggcct ggtcgccgct 40200 
gccggtccag tacgcggact acgcgctgtg gcagcaggag gtgctcggcg ccggtgacga 402 6 0 
cccggagagc ctcctgcgcg agcaggtcgg ctattggcgg tcggcgctcg ccggagcccc 40320 
cgaggagctg cgcctgccgg ccgaccaccg gcgcccgccc gtgtcgtcgt cccgggcgca 40380 
catggccgag ttcgccgtgc cggccgccgc ccacggcgac ctgaccgccc tcacccgcga 40440 
gctcggcgcc acgctcttca tggccgtgca cgcggcgacc gccatggtgc tttccgggct 40500 
gggcgcgggc gacgacctgc cgatcggcac ggtggtggcc ggccgcaccg acgccggcct 405 60 
cgacgacctc gtcggctgct tcgtcaacaa cctggtgatc cgggccgacc tgaccggcga 40620 
cccgaccttc gcggacctgc tgcggcaggt ccgcgagcgg gccctcgacg cgtacggcca 40680 
ccaggacgtg ccgttcgaga agctcgtcga ggagctcgcg ccgtcgcggt cgctgagccg 40740 
ccacccgctg ttccaggtgg ccgtcgccgt ggagaccgac gacctgatcg gcggtcgcgg 40800 
cggcggtccc gccctgcggc tgcccggcct cggcatcgag gtgctgcccg gcgagccctc 40860 
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cgctcgcgac ctcgacctcg acctggtggt gcgcgagacg ttcgacgccg agggacgccc 40920 
cgccggtctc accggggcac tgatcggcgc ggccggcctg ttcgacgccg cgtcggtgga 40980 
gcggctggcc gcgctgctgg cccgcgcgct cgaggcgctg gccgccgacc cgcgcacgcg 41040 
cgccggcgac ctcgacctgc tctccccggc ggaccgccgg ctgatcctgc gcggctggaa 41100 
cgacaccgcg gctccggcgc cggccggact ggtgccggac ctgttcgccg cccaggccgc 41160 
gcgcaccccg gacgcggtcg cggtcgccgg gcccgaccgg gagctgacct atgccgagct 41220 
ggacgagcgc tccggccgcc tcgcgcgctg gctgatccgg cgcggggtcg ccgccgacac 41280 
ccgggtcgcg ctggtgctgg agcgctccgc ggagctgccg gtggcgatcc tcgccgtgct 41340 
caaggccggc ggcgcgtatc tgccgatcga tccggcgcag ccgccgcgcc gcatcgccga 41400 
catcgtggcc gacgccgccc cggcgctcgt gctggcccag gcgtccaccg ccgacgtcgt 41460 
ggccgacgcc tctccggcgc tcgtgctggc cccggcgtcc gacggtgtgc ccaccggcgc 41520 
cgtgcccgtg cacctgctcg actcgcccgc cgtgcgtgac gaggtcgcgc agtgcccggc 41580 
cggggccgtg accgacgccg accggcgggg cgtcctgctc ggcggtcacg cggcctacgt 41640 
catctacacc tcgggatcga ccggacgccc caagggtgtc gtcgtttcgc acgacgcgtt 41700 
cgcgaacctc gtcctggacc agcgccggct cggcatcggg ccgggcagcc gggtggcgca 41760 
gttcgcctcg ccgggcttcg acatgttcgt cgacgagtgg tcgatggcgc tgctcgccgg 41820 
cgccgccctc gtgatcgtgc cgccggagcg ccggctcggc gcggacctcg ccgcgttcct 41880 
caccgagcgc ggggtcaccc acgccacgct gccgccggcg gtggtggcga cgctgccgga 41940 
ggagtcgctg ccacgctcgt tcgtgctgga catcggcggc gacgcgctgc cggacgacct 42000 
ggcccgccgg tggctgcgcg acggccggtg gctgggcaac tcgtacggcc cgacggagac 42060 
cacggtcaac gcggcgacgt ggcgctgcga gcccggcacc tgggagggcg cgaccccgat 42120 
cggccggccg gtcgccaacc tgcgggcgta cgtgctcgac ggccgcctgc ggccggtgcc 42180 
ggtgggcgtg gagggcgagc tgtacgtctc gggcgccggc ctcgcccgcg gctatctgaa 42240 
ccgggccggc ctgaccgccg gcagcttcgt ggcctgcccg ttcgagccgg gggagcggat 42300 
gtaccgcacc ggcgacatcg tgcgctggga cgcgcggggc cgcctcgtct acgccggccg 42360 
cgccgacgac caggcgaaga tccgcggttt ccgggtcgag ccgggcgagg tggaggccgt 42420 
gctggccgcc ggtccgggcg tgaaccaggt cgcggtgatc gtgcgtgagg acgtgccggg 42480 
tgacaagcgc ttggtggcgt acgtcgtggg cggcgacgtg gagaccctcc ggtcgtacgc 42540 
gcagcagcgg ttgcccggat acctcgtgcc gtccgcgatc gtcgcgctgg ccgagctgcc 42600 
gctgacaccg agcgccaagg tggaccggcg ggctctgccg gtgccggact acggccggga 42660 
cgccggtggt gggcgggcgc cggccaacgc tcgcgaggaa gtgttgtgcc gggcgttcgc 4272 0 



cgaggtgctc ggcgtcgagc gggtgggtgt ggaggacgat ttcttcgcgc tgggtggtca 42780 

ttcgctgctg gtggtctcgc tggtggagcg gctgcgccgg caggggatct cggtgccggt 42840 

gcgggcgttg ttcaccacgc cgaccccggc cgggctggcc gaggcggtcg gtgatggtgc 42900 

cgtggtggtg ccgccgaacc tgattcccga gggtgcggcg gagctgaccc cggagatgct 42960 

gccgctggcc gatctgaccg ccgacgaact tgctgttgtc gtggattcgg tgcctggtgg 43 020 

tgcggcgaac atcgcggatg tgtatccgtt ggcgccgttg caggagggca ttttcttcca 43080 

tcacatgatg gccgaccggg attcggcgga cgtgtatgtg acgccgacgg tggtggagtt 43140 

cgactcccgg gaccggttgg acggcttcct ggccgccttg cagcaggtcg tcgaccgtac 432 00 

ggatgtgtat cggacgagtg tggtgtggca ggggctgcgc gagccggtgc aggtggtgtg 432 60 

gcggcacgcg cgcctgcccg tcgacgaggt ggtgctgcgg gacgacctcg acccggtcga 43 320 

gcagctgaac gcgctcggca cggcctggat ggacctgtcc gaggcgccgc tggtgcaggc 43 3 80 

cgtcgtcgcc gcccgccccg gcgatccgca gcgctggctc gccgtgctgc gcatccacca 43440 

cctcgtgcag gaccacaccg ccctcgacat cctcctcgag gagctggcgg cgtacctggc 43 500 

cggccgcggc ggcgacctgc ccgagccggt gccgttccgc gagttcgtcg cgcacacccg 43560 

cctcggcgtg ccccgcgagg agcacgagcg ctacttcgcc gggttgctcg gcgacgtcac 43620 

cgagaccacc gcgccgta.cg ggctcctcga. cgtgcacagc ggcggtctcg cctcggcgca 43 680 

ggcccacctg cggctggacg gcccgctcgg ccggcgcgtg gccgccttcg cccgggaaca 43740 

cggcgtcagc ccggcgacgc tcttccacct cgcgtgggcg cgggtgctcg gcacgctggc 43800 

cgggcgtgac gacgtcgtct tcggcacggt cctgttcggg cggatgaact cgggcgccgg 43860 

Q cgccgaccgc gttcccggcc tgttcatcaa cacgctgccg gtgcgggtgc ggctcggcgc 43920 

gcccgtcggc gacgccctcg acggcctgcg tgaccagctc atcgagctca tcgcccacga 43980 

gcacgcgccg ctggccgtgg cccagcaggc cgcgaacctc ttcggccggc cgctcttcac 44040 

ctccatcttc aactatcggt acgcccgggg ggccgagccg gccggcgccg cgctcgacgg 44100 

catccgcctg ctctccgccc gcgacctcac caactatccg ctggcggtgg ccgtcgacgc 44160 

ggagggcgac acgttctcgc tcaccgtcga cgcggtggcg ccggccga.cc ccgtgcaggt 44220 

cggcgagctg ctcgtcaccg cgctgcgcaa cctgacccgg accgccgaga acgcgcccgg 44280 

aacgccgctg gccgcggtcg gcgtgctggg cgaggacgag ctgagccggg tcgtctccgg 44340 

ctggaacgac accgcccgcc gggtccggca ggcgtcggtg cccgagctct tcgcggagcg 44400 

ggtggcggcc gcgcccggcg cgccggccgt cgccgccggg gacctgcgct ggacgtacgc 4446 0 

ggacctcgac gcccgttccg acgcgctcgc gcggagcctg gtggcggccg gggtgaccgc 44520 
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ggagtcgccg gtcgtcgtcg ccctcgagcg ctccgcggac gtgctgaccg cgttcctcgc 44580 
cgtcgcgaag gccggcggtg tcttcgtccc ggtggacctc tcctggcccc gggcccgcgt 44640 
cgacgcggtg atcgccgact gcgccgcctg gatcgcggtg gccgaccggc cgatgaccgg 44700 
cctgaccgtc gtgcccgcca accgggccgg cgatcccgcg gtcgcgctgc cgccccgccc 44760 
cctgccgggc gcggcggcct accggatgta cacctccggc tccacgggcc ggcccaaggg 44820 
cgtggtgacg acccatcaga acgtcgtcga cctggtcacc gaccggtgct ggggcccgac 44880 
gccgcgggtc ctgttccacg ccccgcacgc cttcgacgcc tcctcgttcg agctctgggt 44940 
gccgctgctg accggcggca cggtcgtggt cgcaccgggg gagagcatcg acaccggtgt 45000 
gctgcggcag ctgatccggg cccacgagct gacccacgtg cacgtcaccg cgggcctgtt 45060 
gcgcgtgctg gccgaggacc cgtcgtgctt cgccgggctc accgaggtgc tcaccggcgg 45120 
cgacgtggtc ccggccgagg cggtgcgccg cgtgctggac gccaatcccg gcgtgcgggt 45180 
gcggcagctg tacggcccga ccgaggtgac gctctgcgcc acccagcacg tggtgcgcga 45240 
gcccagcccg gtgctgccca tcgggcggcc gctcgacaac acccgcgtct acgtgctcga 45300 
cgggcttctc cagccggtcc cggtcggtgt caccggcgag ctgtacatcg ccggcgccgg 45360 
cgtggcccgg ggctacgccg acatgccggg caccaccgcc gagcggttcg tcgccgaccc 45420 
gttcaccgcc ggcggccgcc tctaccgcac cggtgacctg gtccgctgga ccggcgaggg 45480 
cgagctggtg ttcgccggcc gggccgacga ccaggtgaag atccgcggct accgcgtcga 45540 
gccgggcgag gtggaggcgg tcctcgccgc gttgccgggc gtcagccagg cggcggtcat 45600 
cgtccgcgag gacgtacccg gcgacaagcg gctggtggcc tacctggtcg cggcgccgga 45660 
gacggtcgag gccgcccgcg cccacgccga gcagcggctt ccgtcctatc tcgtcccgtc 45720 
cgcgttcgtg cagctggacg cgctgccgct gaccggcaac cagaaggtcg accgggcggc 457 80 
gctgccggca ccgctggggt tcgaagccgg tgccggccgg gcgccggcgg acgcccgcga 45840 
ggagctggtc ggcgccgcct tcgccgaggt gctcgacctc ggccgggtgg gccccgacga 45900 
cgacttcttc gcgctcggcg ggcactcgct gctcgccctc gcgctggtgg agcgcctgcg 45960 
ccggcagggc ctgggcgtct cggtgcgtgc cgtcttcgac gcacgcaccc ccgcggcgct 46020 
gacccgccgc ggcgacggcg gtgccgacga ccggccggcg ctgcgggccg gtgcgcggcc 46080 
cgcgcggctg ccgctttcct acgcgcagcg ccggctgtgg ttcctggccc agctggaagg 46140 
accgagcgcc acctacaaca tcccggtcgc gctgcgcctg gagggcgacc tcgaccggga 462 00 
tgccctgacc gccgccctgc gcgacgtggt ggcccggcac gaggtgctgc gcacggtgtt 46260 
cacggtcgcc gacggcgagc cgtggcaaca catcctcgac cccgcgcggg ccgagcccgc 46320 
gttgccggtc gtggacgtgc cggccggccg ggtcgaggag gcggtcgccg aagcggccgc 46380 
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tgaactccgg 
gcgtgcggct 
ccgggctgct 
ccgacggcag 
cggcgtcggc 
cgctcgacgc 
cgcccgcgga 
acgtgctcga 
atctcggccg 
tgccggagct 
agggtgtgtc 
tggtggcggc 
aggtgccgac 
tgtcgtggcc 
tggccgaccg 
ctgccgagtt 
gctcgacggg 
ccgacacgtg 
cgtcgtcgta 
ggcggagcat 
tgcacgtcac 
aggtgctcac 
acccgggcct 
agcaggtgct 
atgtcctgga 
ccggtgccgg 
tcgccgaccc 
ccgatgacgg 
atcgggtgga 



50100 
50150 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
51480 
51540 
51600 
51650 
51720 
51780 
51840 



gccgggcgag 
cgtccgggag 
cgaggcgtat 
ggacgcgctg 
cgtggagtcc 
gtttgccgag 
tgggcattcg 
ggtgtcggct 
cgagggcggc 
gtcgttcgcg 
caacatcccg 
actgcacgat 
cgagccgtgg 
cgccgacggg 
ggagatcccg 
ggtcgtgcat 
cgtcgcctac 
gtacgccgac 
cgccatctcc 
accgctgccc 
gttcaccgtg 
caccgtcttc 
caccgacatc 
catcggcttc 
caccgacctg 
ccccttcgaa 
cttccaggtc 
cggcctggag 
gagcctgagc 
cgcggcggcc 
gctgctcgcc 



gtcgaggctg 
gacaccccag 
gcgcaggagc 
ccgctgacca 
ggcgcgggcc 
gtgctcgacc 
ctgttggtgg 
cggctggtct 
acggcccggg 
caacgccggt 
gtcgcgctgc 
gtggtcgccc 
caacacatcc 
cttccggagg 
ctccgggcgg 
cacatcgccg 
gccgcccgga 
tacgccctgt 
cagcaggtcg 
gccgaccacc 
cccccggccg 
atgaccgtgc 
cccatcggcg 
ttcgtcaaca 
ctgcaccgca 
aaactcgtcg 
atgatgaccc 
acggcggtgc 
gaggcgtacg 
gacctgttcg 
gtgctgcccg 



tcttggccgc 
gggataagcg 
gccttccggg 
gcaaccagaa 
gggcgcccgc 
tggatcgggt 
tgcggttggt 
tcgatgcgcg 
agO'cggtgcg 
tgtggttcct 
aactcgacgg 
ggcacgaggt 
tcgacacgcc 
cggtggccgc 
tgctgctcgc 
ccgacggctg 
tccggggcga 
ggcagcgcga 
cccattggcg 
cccgtcccgc 
tgcaccacca 
aaaccgccct 
tcgccgtcgc 
ccctcgtcct 
cccgcgacac 
aagacctcgc 
tgcagagcac 
tgtcgccggg 
acgacgacgg 
agcacggcac 
ccgatcccgg 



gcatccggac 
gctggtcgcc 
ctacctggtc 
ggtcgaccgg 
cgacgcgcgt 
cggtgtcgac 
gggccgtatt 
gactccggcc 
ggcgcgggtg 
gtcccagctg 
cccgctcgat 
gctgcgtacc 
gtcggtgagc 
ggcggccgcg 
caccggcgcc 
gtcgatgcag 
ggcgccggcc 
cgtgctcggc 
gcgacagctc 
cgaggccacc 
actcgccgaa 
cgccgtcctc 
cggacgcacc 
acgcaccgac 
caccctgcac 
acccacccgc 
cgggcgggcc 
cggcgtcgcc 
ccgcccggcg 
cgccgagcgg 
cgcccggctc 



gtggctcagg 
tacgtcgtcg 
ccgtcggcct 
gccgcactgc 
gaagagctga 
gacgacttct 
cggcaggtgt 
ggtgtggtgg 
cgtcccgcgc 
gacggcacga 
cgggacgcct 
gtcttcaccg 
gtccccgtca 
cacaccttcg 
gaccggcacg 
cccctcgccc 
tggaccgccc 
tccgagcacg 
gccggagccc 
taccgcggcc 
ctcgcccgcc 
ctgtccaaac 
gaccccaccc 
ctgaccggca 
gccttcaccc 
tccctcgccc 
ggcgaggcgg 
gccaaggtcg 
ggtctcgccg 
atcgccggtt 
ggcgacgtgg 



tggcagtggt 
gcggcgatgt 
tcgtccatct 
ccgcgccgtc 
tgtgtgccgc 
tcgctttggg 
tcggggtgga 
cccgcttgtc 
g^9"tgccgtt 
gcacgaccta 
tcaccgcggc 
tcgccgatgg 
tcgaggtgcc 
acctgagccg 
tgctggtgct 
gggacctcgc 
tgcccgtcca 
acccggacag 
ccgacgagct 
acaccgtgga 
gcaacggcgt 
tcggcgccgg 
tcgacaacct 
accccaccat 
accaagacgt 
gccaccccct 
ccgagctgcc 
acctcgacct 
gaacgctcgt 
acctcgcgcg 
acctgctcga 



51900 
51960 
52020 
52080 
52140 
52200 
52260 
52320 
52380 
52440 
52500 
52560 
52620 
52680 
52740 
52800 
52860 
52920 
52980 
53040 
53100 
53160 
53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 



cggcgaggag cggcggctgg tcctcaccgg ctggaacgac acgacggcgg ccgtgccggc 53760 

ggtggcggtg cccgagctga tcgagcggcg tgccgccgcc gaaccggagg ccggcgccgt 5382 0 

ctggtgcggc gacacgcacc tgcggtacgg cgagctgaac gcccgcgcga accgcctcgc 53880 

ccggctgctc gtggagcgcg gggcgggacc cgagtcgatc gtcgcggtct gcctggaacg 53940 

ctcggccgac ctcgtcgtca cgctgctggc cgtgctgaag accggggccg cctacctgcc 54000 

gatcgatccc ggatatccgg ccggccggat cgcctacatg ctcgccgacg cccgccccgc 54060 

gctgctcgtc acgagcccgg cggtcgcctc cggtgacagc ctcccggacg gtggcgcgca 54120 

acggatcgtc ctcggcgatc cggacaccgc ggcggccctc gacggcctcg ccggcaccga 54180 

cctgctcgtc tcggagcggc gcggcgtcac gcacccggca catccggcgt acgtcatcta 54240 

cacctccggg tcgaccgggc gccccaaggg tgtcgtcgtg ccgcacgggg ccctcacgaa 543 00 

tttcgtggcg gcgatgagcg accggctcgc gctgggcgcc ggcgaccggc tgctcgcggt 54360 

caccacggtc gccttcgaca tccacgtcct ggagctctac gtgccgctgg tcggcggcgc 54420 

cggagtggtc gtcgccgagg acgccgtggt gcgcgacccg gccgcggtcg ccgcgctcct 54480 

cgaccggcac gccgtgacga tcgtgcaggc caccccggcg ctgtggcagg cgctgctcgc 54540 

cgggcacgcc gacgccgtcc gcgacgtgcg gctgctcgtc ggcggcgagg cgctgccgcc 54600 

cgcgctcgcc ggccggatgg ccgcggccgg tcgcggtgtc accaacctgt acggcccgac 54660 

cgaggtcacc gtgtgggcga ccgtcgccga cctcggcgcg agcccggccg ggccggtgcc 5472 0 

gatcggcacg cccctgcgca acacccgcgc cttcgttctc gacgacgcgc tgcgcccggt 547 80 

gccgccgggc gtgccgggcg agctctacct cgccggcgat cagctcgcgc ggggctacca 54840 

cggccgggcc ggcctgaccg ccgagcgctt cgtggccgac ccgttcggcc gcggtgagcg 54900 

gatgtaccgc accggcgacc gggtccggtg gacccgcggc ggcagcctgg agttcctggg 549 60 

ccgcgtcgac gaccaggtca agatccgcgg tttccggatc gagctgggcg aggtcgaggc 5502 0 

ggcgcttgcc gcgttcgggc cggtggcccg ggcggccgcc gccgtccgcg aggacgtgcc 55080 

gggcgaccgc cggctcgtcg gctatgtcgt gccggccgcc ggcgagccgg agcccgaccc 55140 

ggcggcggtc cgcgcgcacg tcgccgccca gctgcccgcc tacatggtcc cgtcggcggt 552 00 

cgtggtcctg cccgacctgc cgctgaccgc gaacggcaag ctcgaccgca aggcgctgcc 552 6 0 

ggcacccgac tacggcgccg cctccgccgg ccgggcaccg gccgacgagc gcgaggcgct 55320 

catctgcgcg gtgttcgccg agacgctcgg cgtgaccgac gtcgcagccg atgccgactt 55380 

cttcgccctg ggcggccatt cgctgctggc cgtgtcgctg gtcgaacggt tgcgcgagca 55440 

cggcatcgcg gttccggtcc gcgccctgtt ccagtcgggc acccccgagg gcctggccgc 55500 



cgcggcccgc 
cgccaccgcg 
ccgggtggtc 
gccgctgcag 
gctgccggcc 
gcaacacgtg 
cgagccggtg 
cgcgggcgcc 
ccgggccccg 
gctgctccag 
gatccaggcc 
ctacgtcgcc 
gctgctcggt 
gcggccgacc 
cacggcgagc 
gctgggcacg 
cggcgccggc 
ggtgcgcctc 
cgagctgatc 
cggcgggacg 
cgacaccggc 
ctacccggtg 
ggcccagccg 
caccgacgcg 
ggcggagcgc 
gacgctgccc 
cgccgagaac 
ctcgctgatc 
gccgggcctg 
cgatcccggc 
gctgctggcc 



gccgagggcc 
ctcacgccgg 
gccgccgtgc 
gaggggctgc 
gtcctcggat 
atcgaccggc 
caggtggtct 
gacccggtgc 
ctgctgcggg 
atccaccacc 
ttcctcgccg 
cgggcccggc 
gacgtcaccg 
ggaaaggccc 
gagctgggcg 
cttgccggcc 
gcccggtccg 
gccgccgccg 
ggtcacgagc 
ccgctgttca 
gacgaggaga 
gccgtctccg 
gccgcaccgg 
ctcgcgggca 
cgggaggttc 
gcgatcgtcg 
gaccggctca 
gcccgcgggg 
gtggtggcga 
tatccggcgc 
acggccggca 



cggacgagcc 
cgatgctcac 
ccggcggggc 
tcttccacag 
tcgattcgcg 
acgacacgta 
ggcgccgggc 
aggaactgct 
tgcgctgcgc 
tcgtccagga 
gccgcggcgg 
tcgccggcgc 
agagcaccgc 
atcgcgaggt 
tgagcccggc 
gcgacgacgt 
ggcgagccct 
gcagccgcga 
acgccccgct 
cctcgatcct 
tcgagggcat 
tcgacgacga 
accgcgtcgc 
cgcccgacgt 
tccggacgcc 
gcgagtgggc 
cgtacgccga 
tcggtcccgg 
tgctggcgat 
cccggctggc 
ccgcggacgc 



ggccgtgccg 
cctcgtcgac 
cgcgaacgtg 
cctgatggac 
gtcccgcctc 
ccggaccgcg 
cacgctgccg 
cgccaccgcg 
ggcccggccg 
ccacgccacg 
cgagctcgcc 
gccggccgag 
cccgtacggg 
cgaccggcgg 
gaccgtgttc 
cgtcttcggc 
cggcccgttc 
gacgctggcc 
gacgctggca 
caactaccgg 
cgagctgctc 
cggttcgggc 
cgagctgctg 
ggagccggcg 
gaacgccacg 
gcggaccacg 
gctggacgct 
tgccgtcgtc 
cgtcaaggcc 
ccggatggtc 
cgtgcccgcc 



gccaacggca 
ctcgacgccg 
gccgacgtct 
ggcggcgacg 
gacgcgttcc 
gtggtgcacg 
gtcgaggagg 
ccggtcgagt 
gacggcggcg 
ctcgacgcga 
gcgcccgagc 
caccgggcgt 
ctgaccgacg 
ctggccgccc 
catctcgcct 
accgtcctgc 
atcaacaccc 
gggctgcgcg 
caggccgcga 
caggggccgc 
tccaccgagg 
ttccggctca 
cacacctgcc 
cggatcgacg 
gcccgcgacg 
cccggcgcga 
cgcgccaacc 
ggcatgctcc 
ggcggcgcct 
gaggacgccg 
gggccgcagc 



tcccggacgg 
aggagatcgc 
atccgctcgc 
acgtgtacgt 
tggccgcgct 
acggcctgcg 
tgaccctgac 
tcgcgctcga 
gatggctggc 
tgctcgccga 
cgttccgcgg 
acttctcccg 
cgcgggacgc 
gcgtgcgggc 
gggcgcgggt 
tgggacggct 
tgccggtgcg 
cccagctggc 
gcggcgtgcc 
ccgccggcga 
aacgcagcaa 
ccgtcgacgc 
tgcaccggct 
tgctcggcga 
tggcggcggc 
ccgcggtcac 
gcctggcccg 
tgccccgctc 
acctgccgct 
cgccggcgct 
gactgctgct 



55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 



ggacgacccc 
ggagcgcacc 
gacgggacgc 
ctccacctgc 
cgacgcggcc 
accggcggag 
caagctgttc 
cgccggcctg 
gcttctcgac 
caccatcacc 
gatcggcgtg 
gcccgtgggg 
cggacgaccg 
gatgtaccgc 
ccgcgccgac 
ggtcctgtcc 
cggcgccaag 
gctggccgag 
cgcgacggtg 
gctgccggcg 
gatcctctgc 
cttcttcgcc 
cgtcttcggg 
ggccacccgg 
ggagaacgtg 
acaggacggc 
ggcgctcgcc 
cccgctcgtg 
gctcggcggc 
cggcaccttc 



ggcaccgcgg 
cacccgctgc 
ccgaagggcg 
cttcagctgg 
accttcgaga 
acgctctcgg 
ctcaccaccg 
aaggcggtct 
cgggtgcccg 
acccttcatc 
cccatctccg 
gtcgccggtg 
gcgccgaccg 
accggcgacc 
aaccaggtca 
ggacatccgg 
cggctggtcg 
tccgtgcgcg 
gtgctcccgg 
ccggacgtgc 
ggcgtcttcg 
agcggcggcc 
gccgagctgc 
ctcggcgcat 
cccctgtcgt 
acctacacca 
gcggccctgc 
gacggcgaac 
ggtgacgtcg 
gatctggccg 



cggagctggc 
gccccgggca 
tgctggtgcc 
caccggacga 
tctggggcgc 
tggccgagct 
gcctgctgca 
acaccggcgg 
gcctcgagct 
gcgtacgccc 
acacccgggt 
agctctacac 
cggagcgctt 
tggtgcgctg 
agatccgggg 
ccgtggcacg 
catacgtcgt 
cctacgccgc 
acctgccgtt 
cgggcgggcc 
cccaggtgct 
attcgctgct 
cgatccgggc 
ccgcgccgcg 
acgcccagcg 
tcccgctcac 
gcgacgtcct 
ccgtccagcg 
cggccgccga 
ccgagatccc 



ccggctggac 
cccggcgtac 
gcacgccggc 
cgtcctgccg 
gctgctcaac 
gcgggccttc 
cgaggtgatc 
tgacgtgctc 
ctacaacgcc 
ggaggacctc 
gtacgtgctc 
ctccggcatc 
cgtggcgtgc 
gaccgccgac 
cttccgggtg 
ggccgcggtg 
gccggcccgg 
ccggcaggtg 
gaccagcagc 
gggccgcgcc 
cgggctgccc 
ggccacccgg 
cgtcttcgag 
gcgactcgcg 
gcggctgtgg 
cgtgcggctc 
ggagcgccac 
ggtgctgccg 
cctcggcgcc 
ggtgcgcgcc 



ggcgacccga 
ctgatgttca 
atcgaccgca 
cacctctcgt 
ggcgccaccc 
ctcgcggacc 
gacgccgacg 
tccccggcgc 
tacggcccga 
gacgcgggca 
gacgacgcgc 
gggctggcgc 
ccgttcgcac 
gggcgcctgc 
gagccgggcg 
ctggcgcgcg 
ccggacgagg 
cccgactatc 
ggcaaggtcg 
gccggcacgc 
acggtcggcg 
ctggtcagcc 
gcgccgacgc 
ctcggcgaac 
ttcctcgacc 
gacgggccgg 
gaggtgctgc 
gtgcacgaca 
gcggtcgccg 
tggctgttcc 



tccgcgacga 
cctccggatc 
tggtccgccg 
cggtgtcgtt 
tcgccgtcgc 
ggggcgccac 
tgaccgccct 
actgccggtc 
ccgagaacac 
cgggcgtacc 
tgcggccggt 
acggctacgc 
ccggtgagcg 
tgttcgccgg 
agctcgagac 
aggacacgcc 
e-cggggsicgc 
tgatgcccgc 
accgggccgc 
tcaccgagga 
tcgacgacga 
ggctgcgtgc 
cggccacgct 
gcgcccggcc 
gcctggaggg 
tcgaccgggc 
ggaccgtctt 
ccggcttcac 
aggccacggc 
gcgccgggcc 



57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
58560 
58620 
58680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 





accc tcgtgc 


tgctggtgca 


ccacgtcgcc 


ggcgacggct 


ggtcgatgac 


59220 


accac tacfcc 


cgcgacatcg 


ccaccgccta 


cgacagccgc 


cgcgagagcc 


gggcgccgca 


59280 


a t g g g" a g c c g 


ctgcccgtgc 


agtacgccga 


ctacgcgctc 


tggcagcgcg 


aactgctcgg 


59340 


cgccgaggac 


gatccggaga 


gtttgctgtc 


gcggcagctg 


gcctactggc 


gggacgcgct 


59400 


ccracCTcrccrta 


ccggaggagc 


tggacctccc 


ggccgaccgg 


ccgcgcccgg 


ccgaggccac 


59460 


gcaccgggga. 


cacgaggtgc 


ccgtgcgggt 


gccggccgag 


gtgcaccggc 


gcctggccga 


59520 


cr c t cr CT c c c cr cr 


tccgagggcg 


tgaccgtgtt 


catggtgctg 


caggccgcct 


tcggcacgct 


59580 


gc tgtcccgc 


ctcggcgccg 


gcgccgacat 


cccgatcggc 


acggcggtcg 


ccggccgcac 


59640 


cgaccaggcc 


ctcgacgagc 


tcgtcgggtt 


cttcgtcaac 


acgctggtga 


tccgggccga 


59700 


cc tgtccggc 


gaccccacct 


tccgggagct 


gctcggccgg 


gtgcgcgcca 


ccggcctgtc 


59750 


cgcctacgag 


caccaggacg 


tcccgttcga 


gcggctcgtc 


gaggtgctgg 


caccggcccg 


59820 


atcgcfccgcc 


cggcacccgc 


tcttccaggt 


catgctcacg 


ctgcagaaca 


ccggccgcgc 


59880 




gaccaggccg 


tcccgccggc 


cgccggatcg 


gccgcggcca 


agttcgacct 


59940 


cgagatcagc 


atcgcggaga 


cgttcgccgc 


cgatggcgag 


ccggccgggc 


tcagcggcgt 


60000 


tctcatcgcc 


gccgccgacc 


tgttcgagcc 


ggccaccgcg 


gccgcgttcg 


ccgaacggct 


60060 




ctggccgcgg 


ccggcgccga 


tccgcggctg 


cgggtcagcc 


aggtcgacat 


60120 


CC tcagcgcc 


aaccracf c Cf c Q" 


aggccgtcct 


gtccggcggc 


aacggcggca 


ccgcgccggt 


60180 


tcccgtcacc 


accgtcccgg 


cgctcttcgc 


cgagcaggcc 


cgccggaccc 


cgggcgcggt 


60240 


QQC era cere tcr 


agcgagggga 


tgtcgctcac 


ctacgccgat 


ctcgccgccc 


gcgtgaaccg 


60300 


gctcgcccgg 


cacctggtga 


gcctcggcgc 


cggaccggag 


accgtcgtcg 


gtatcgccat 


60360 


gagccgcggc 


ctcgacatgc 


tggtggcggt 


cctcgcggtc 


gggcaggccg 


gcgccgccta 


60420 


cctgcccgtc 


gacccgtcct 


acccggacga 


gcgcaaggag 


ttcatgctca 


ccgacgccgg 


60480 


cgccgcgtat 


gtgctcacct 


tggcctcgga 


cgccgaccgc 


gtgccgccgg 


gaaccccggc 


60540 


ccrccCTCcGtcr 


gtcctggacg 


agcccgtgac 


ggccgcgcgg 


atcgccgggc 


tcgatccggc 


60600 


cgacc tgacc 


gacgccgacc 


gggtggcgcc 


gctgctgccg 


gcccaccggg 


cgtacgtcat 


60660 


c ta.cacc tec 


ggatccaccg 


gccggcccaa 


gggtgtcgcc 


gtcgagcacc 


gcaccgtggt 


60720 


caacc tgctg 


tcctgggcgg 


ccgggcggtt 


cggcggcgcc 


gacttcgccc 


ggacgctcgc 


60780 


cgccacctcg 


ctcaacttcg 


acgtctcggt 


cttcgagatc 


ttcgggccgc 


tggtgtccgg 


60840 


cggcagcatc 


gagatcgtca 


ccgacctgct 


cgccctggcc 


gacccggcc t 




60900 


ggaggccagc 


ctggtcagcg 


gcgtgccgtc 


ggcgttctcg 


cgggtcctcg 


accggggcga 


60960 


catcgccgcg 


cgcacccgca 


gcgtggtgct 


ggccggcgag 


gcgctgaccg 


ccgacgtggt 


61020 



gaacgccacc 
ggcgaccgtc 
gatcgggcgg 
gccgccgggc 
gggccggccc 
gcgcatgtac 
cggccgggcc 
ggccgtcgtg 
gcccggcgac 
ggcgctggcc 
ggcggccgtg 
tgcgctgccc 
cgaggaggtg 
cgacgacttc 
gcgccgccgc 
cctggccgag 
cgcgcggcac 
ccaggtcgtg 
gccgctccag 
ctacatcatg 
cgccctccag 
cctgcgcgag 
gctcgatccg 
ggacctgaac 
acggctggcc 
cctcggcgag 
gttccgcgac 
cttcgccggg 
gcaccgcgac 
ccggctccgc 



cgtgccgccc 
tactcgaccg 
ccggtcacca 
gtggtgggcg 
ggcctgaccg 
cgcaccggcg 
gacgaccagg 
gcgcgccagg 
gcccgcctgg 
gccaccgtcc 
gtggtcctcg 
gcgcccggcc 
ctctgcgagg 
ttcgcgctcg 
ggcgtgacgg 
gcggccgcgc 
atcacgcccg 
gccactgtgg 
cagggcatgc 
cccgcggtcg 
cacgtgatcg 
ccggtgcagg 
gccggcggcg 
cgggcgcccc 
ctgctgcggg 
gtgcaggcgt 
ttcgtggcgc 
ctgctgggcg 
ggcgtcggcc 
gccgtggccc 



tgcccggtgt 
cctggcacac 
acacccgcgc 
agctctacct 
gcgagcgctt 
accgggtccg 
tcaagatccg 
ccggcgtggc 
tcgcgtacgt 
gctcggacac 
acgagctgcc 
tggccgaggc 
tcttcgccca 
gcgggcactc 
tcggggtgcg 
ccaccacggt 
gcctgctgcc 
acggcggcgc 
tcttccacca 
tggagttcga 
accgcaacga 
tggtctggcg 
atcccgccgc 
tgctcgacct 
ttcaccacat 
tcctggccgg 
agacccgcgg 
acgtcaccga 
tggtgcgcca 
gccggctcgg 



ccgggtggcc 
cgaccgggac 
ctacgtcctc 
ggccggcgcc 
cgtggcctgc 
gtggaacgcc 
cggcttccgt 
ccgggcggtg 
cgtgccggcc 
cgcgcgcgag 
cgtcacggcc 
gggcagcggc 
ggtgctcggc 
cctgctggcc 
cgcgctcttc 
cgccgttccg 
gctcgtggag 
cgccaacgtg 
cctcatggcc 
ctcggcggac 
cgtctaccgc 
ccgggcgccc 
ccagctgcac 
ccacgtggcc 
ggtgcaggac 
ccgcggcgac 
cggggtgccg 
gcccaccgcg 
ggaacgcccg 
ggtgagcccg 



aacatctacg 
gtgaccggcg 
gacgaccgtc 
cagctggccc 
ccgttcggcc 
gacggcgacc 
atcgagccgg 
gtgctggccc 
gaccgggacg 
ctgccggcgt 
aacggcaagc 
cgcgggccgg 
ctgccctcgg 
gtctcgctgg 
cagacgccca 
cccaacctca 
ctggagcagg 
gccgacatct 
ggcgacgacg 
cgcttcggcg 
accggcgtgg 
ctgcccgtga 
gccgccgccg 
gcccggcccg 
cacatggggc 
gagcttcccg 
gaggccgagc 
ccgtacggcc 
ctcgacggtg 
gcgaccgtca 



ggccgaccga 
gcgccgcgcc 
tcacgccggt 
gcggctatct 
cgggcgggga 
tggtcttcgc 
gcgaggtgca 
ggagcgactc 
ccgaccgccg 
acctggtgcc 
tcgatcgccg 
tcacccaccg 
tcggcgtgga 
tggagcagct 
cggtcgccgg 
tccccgagga 
ccgagatcga 
atccgctcgc 
gcgaggacgt 
ccttcgtcga 
tctgggacgg 
ccgaggtgac 
gcgcccggat 
aggacggcca 
tcgaggtgct 
atccgctgcc 
accggcggtt 
tgctcgacgt 
aggtggtggc 
tgcacgtcgc 



61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61550 
61620 
61680 
61740 
61800 
61860 
61920 
61980 
62040 
62100 
62160 
62220 
62280 
62340 
62400 
62460 
62520 
62580 
62640 
62700 
62760 
62820 



ctgggcgcgc gtgctcggcg tgatctccgg ccgcgacgac gtggtcttcg gcacgctgct 62880 
gctgggccgg ttcagcaccg gcgccgaccg ggtgcccggc ccgttcatca acacgcttcc 62940 
ggtgcgggcc cggctcggcg gcacgggcgc cgcggcggcg gtggcggaga tgcgccggct 63000 
gctggccgag ctgctcgagc acgagcacgc gccgctgacc acggcgcagc aggccagcgg 63060 
actctccgga aacctcccgc tgttcacggc gctgttcaac tatcggcaca acacgtcgcc 63120 
gggtgcggac ccgtcgcccg cggccggccc gaccgagggc atccgcccgg tctccatgcg 63180 
ggagcgcacc aactatccga tctcggtggc ggtggacgac gacggcgagg gcctcggcgt 63240 
ggcggtcaac gcgatcccgc cggtgcggcc ggaggcggtg tgcgagctcg tggcgaccgc 63300 
gaccgagagc ctgacctcgg cgctggagct gttcctcgac ggcggtccgg acaccgcggt 633 60 
cggcgagctc gacgtgctgc cgccggggga gcggtcgcgg ctgctggtgg agtggaacga 6342 0 
cacggctcgt ccggtggtgg agtcgtcggt gccggcgttg ttcgccgagc gggtggcggc 63 48 0 
cgcgccggat gcggtcgcgg tggtgggcga gggtgtgtcg tggtcctatc gcgagcttga 63540 
ccgtcgctcg gatgtgctgg cgcggagtct ggtggcggcg ggtgtgggcc tggagtcgcc 63600 
ggtggtggtg gccctcgaac ggtccgccga cgtgctgacc gcgtttctcg ccgtcgcgaa 63660 
ggccggcggt gtcttcgttc cggtggactt gtcctggccg cagacgcgta tcgatgcggt 63720 
gatcgcggac agccggccgg ttctggtgtt ggacagcgtg gatctgccgg ccgcggaggc 63780 
cgacctgccg cgggtgccgg ccggtgcggg cgtgtatcgg atgtacacct cgggttccac 63840 
gggccggccc aagggtgtgg tgaccaccca ccagaatctg gtggatctgg cgaccgacac 63900 
gtgttgggga tcgacgccgc gggtgttgtt ccacgccccg cacgccttcg acgcgtcgtc 63960 
gtacgaaatc tgggtgccgt tgttgaatgg cggcacggtc gtggtggccc cgcggcgcag 64020 
catcgacgcc accgtcctga gggacctggt ccgcgggcat gagttgacgc acgtgcatgt 64080 
gaccgcgggc ctgttgcggg tgctggaccc gtcgtgcttc gcggggctga ccgaggtttt 64140 
gaccggcggg gatgccgtgt cggcggaggc ggtgcgccgg gtcaaggaag cgaacccggg 642 00 
tctgcgggtg cgccagttgt acggcccgac cgaggtgacc ttgtgcgcca cgcagcatct 64260 
gctggatgac ggggtgccga tcgggcggcc gttggacaac acccgcgtct acgttctcga 64320 
cgacctcctg cggccggtcc cgacgggtgt ggtgggggag ctgtatgtgg ccgggtcggg 64380 
tctggcgcgt ggctatgcgg gcatgccggg tttgacggcc gagcgatttg tcgccgaccc 64440 
gttctcggtg ggtggtcgcc tctaccgcac cggtgatctg gtccggtgga ccgacgacgg 64500 
tgtgctgcac ttcgccgggc gggccgatga tcaggtgaag atccgcggct atcgggtgga 64560 
gccgggcgag gttgaagcgg ttctggctca acaccccgac gtcagccagg tcgcggtcgt 64620 
cgtccgagag gacgcgccag gggataagcg gctggtcgcc tatgtcgtcg gcggggatgt 64680 



cgaggcgtat gcgcaggagc gccttcccgg ctacatggtt ccgtcggcct tcgtccatct 64740 

ggaagcgctg ccgctgaccg cgaaccagaa ggtcgaccgg gccgccctgc ccgcgcccga 64800 

gcgggagacg acgacaccgg gtaaggcacc cgcccccgga ccgctcggca acctcgagga 64860 

gtcgatgtgc caggcgttcg ccgaggtgct cggcctcgac agcgtcggcc cggacgacga 64920 

cttcttcgcc ctgggcggcc actcgctgct cgccgtcgcg ctcgtgcagc ggctcaaggc 64980 

acgcggtgtc gccgtcacgg tgcaggacat catggccgcg cccacggtct cggagctgat 65040 

gggctcgctg agcatgtcgt cgatccggga ctccctcggc acgctcctgc cgatccggcg 65100 

caccggcgag ctgccgccgc tgttctgcgt ccatccggcc ggcgggctca gctggtgcta 65160 

cctgccgctg gcccggcacg tgccggccga ccgcccgatc tacggtctgc aggcgcgcgg 65220 

cgccgacggc cgggagccgc tcgcaccgtc gctgcgcgag atggccgccg actacgtgag 6 5280 

ccggatgcgc gccgtgcagc ccgaggggcc gtaccacgtg ctcggcttct ccttcggcgt 65340 

ggcgcccgcg cacgagatcg ccgtccagct gcgcgagcag ggcgccgagg tcgtgctggt 65400 

gctcatggac tcctatccca tggaggatgc ggagtccggc gagcaggcgg ccgacgagga 654 60 

ggagctgccg tgggaggagc tcatcgaggc cgagttcggc cgggtgctcg gcggcttctc 65520 

ccgcgacgaa ctggcggcct tcgccgccgt cttccgcaac aacaccaaga ttcgcgcacg 65580 

ccaccggctg ggccgcttcg acggggacgc cctgctgatc gcctcgaccg acagcgcacc 65640 

cgacggcgag tccaacacct ggcggtgggc gccgtacatc accggtgaga tcacccaggt 65700 

ggtgctcccc tgcgagcaca ccgacctggt acgccccgac atgctcgcgc tgctctggcc 65760 

ggccgtcgag gcgtggcagg ccgggcggca ccgaccttag tcaccacagc gtcgagagga 6582 0 

ccgacatgca gaagatcccg ctcgtgtgtg tgccgttcgc cggtgccggc gcctcgttct 65880 

tccacccgtg ggccgagctc gccgggccgg accggccgat cgtcgcgctc cagcttccgg 65940 

gccgggagtg gcggctgctc gacgaaccgt acgcggacgt cgtcgcggcg gccgcggacc 66 0 00 

tggcgctcac cgtcgccgac gaggtgggcg cggggggccg ggtggcgctc ttcgggcaca 66060 

gcctcggcgc cgtcctcgcg tacgagatag cgcacgcgct ggtgcgcgac ggcgaggtgg 66120 

gcgtggagcg gctcttcgtc agcggctcgc ccgatccctg gacccctcgc accaaccggg 66180 

cgagcggcct ggacgacgag gagttcctgc tgcgggtgcg cgagttcgcc ggttacgacc 66240 

acgaggcgct cgccgatccg gacatgcgcg agctgatcct gcccgcgctg cgcgccgacg 663 00 

tcgagatgca cgagagctac gtggcgggca gcgccgatcc gctgcccgca cccgtcaccg 663 60 

cgctgcacgc ccgcgacgac gcgctggtct ccgccgagca gacggccggg tggagcaagg 66420 

ccaccagcgg cccgttccag ctggtcgagg tggacggcgg ccacatgtac ctcaccgagg 66480 
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acccggccgg 
aacccgtgcg 
gcgcctgcgc 
ccgacctgcc 
acctctgccg 

cggcgctcac 
tcaacaacgc 
agtggcagct 
gccggcacat 
aggtcggcta 
cccgggccgc 
gttcggtgcg 
tcgacgtgcc 
tcatcgagcc 
aggtcaccgg 
gacgtgccca 
gcgcttgccg 
gcccgcatgc 
cgccgccgcg 
ctgctcgcct 
gaccgcgacg 
gcggaaccgg 
ccggacgacc 
tggcccgccg 
ggcgcggcgg 
gcctcgggct 
ctcgccccgc 
gtctcccgga 
tcgccggccg 
gccgatgacg 
cctgcgcccg 



cctgctgcgc 
cttgaccggc 
cgtggccttc 
gggggtgccc 
caagcagggc 
cgcggcggcc 
gggcatcgcc 
gatgatcgac 
gaccgagcgc 
ccggcacttc 
cgcgctcgac 
cgacgatccg 
ggtcgccgag 
ggacgacgtc 
ctccgtcgtc 
agtcccagcc 
tgcgcctggc 
cgttctggca 
agcgcgaact 
acgcggacgg 
cgctgatcgc 
acgcgccgcc 
ggtgggccga 
cgctcgccaa 
cgggcttggg 
ccggcgccgc 
cggacgagta 
cgcccggcgg 
ccgccgaagc 
gcgcgaccgc 
cgccgccggc 



ctgatcgccg 
aagaccgcca 
gccgccgagg 
tatccgctgg 
gccgcggtgc 
gatcgggcga 
gcgccgtccg 
gtcgacctct 
cgctcgggca 
gccggctacg 
tacgcgccag 
cagttcgagg 
cacgagcaga 
gccaacgccg 
acggtcgacg 
cgccaccaga 
cggaccgatc 
cgagcacgtg 
cgcccgcccg 
ctcggccgac 
cctggcccgc 
gccctcggcc 
gctgcgcgtg 
ggtcctcgcc 
cgcggcggcg 
cgcggtcccc 
cgtgcccttc 
cggcaccgag 
gttcgcccgg 
cgagcccacg 
tgcggctcgc 



ccgacctgga 
tcgtcaccgg 
gagccgacct 
gcaccgtggg 
tcaccgtccg 
tcgaccgctt 
gaaaggtcac 
ccggcgcgtg 
gcatcgtcaa 
tcgccgccaa 
cgaaggtgcg 
gccggatgct 
ccttcctgca 
cgatctggct 
gcggattcac 
accgcggcgc 
gacccggcgc 
gcggcccggc 
gtgccgccgg 
ctggtgctgg 
ccggagcggg 
gcgcccgcct 
ccggcgcgcg 
cggtacgagc 
ggctcgggtg 
ggcccggtgg 
ctggcgccca 
ctgcggtgcc 
atgctggccg 
ccgccggccg 
acgctgaccg 



ccgtgactga 
cgcggcccgc 
ggtgctcctg 
ccagctggag 
ggcggacgtg 
cggcggcatc 
ccacgagatc 
gcgcatgacg 
catcgcctcg 
gcacggcatc 
ggtcaacgcc 
gtcggagatc 
ggcgcagccc 
cgcctccgac 
cacgcgctga 
ccggcgccgc 
cgatcgagcg 
cgggcgatga 
agcccggtgc 
tggcccgccg 
cgccgcgcgg 
ggggcctggg 
gcccggccga 
cgggcgcggc 
tggcggcggg 
cgctggcctt 
cccacccgct 
gccaccggct 
cggcacacgg 
cacccgcacc 
ggctcttcgc 



acccgaggag 
ggcctcggcc 
gaccgcgcgg 
cacaccgccg 
cgcgacctcg 
gacgtgctcg 
accgaggacg 
gcggcggtcg 
acggccggtc 
gtcgggctca 
gtctgcccgg 
gcccggtcgc 
atgaacgccc 
gaatcgcgcc 
acggggagaa 
cgagtgccac 
gcggctcgcc 
ggccgcgctg 
gcgggcggtg 
cgaccgcctg 
gcgcaaaccg 
cgacggcggc 
cccggcgcgc 
tgcgggctcg 
ctccagcgcc 
cgacggcgac 
caccgtgcag 
cggcgcggtc 
cgagccgccg 
cgcacccgca 
cgagcaggtc 



66540 
66600 
66660 
66720 
66780 
66840 
56900 
66960 
67020 
67080 
67140 
67200 
67260 
67320 
67380 
67440 
67500 
67560 
67620 
67680 
67740 
67800 
67860 
67920 
67980 
68040 
68100 
68160 
68220 
68280 
68340 



gccgctcgtc ccacggccgt cgccgtctcg gacgatcggg gtcggcacac ctaccgcgag 68400 

ctcgacgagt ggtccggccg gctggcccgg gggctgcgga aggccggcgt gcgcgacggc 6 8460 

gacgcggtcg gcgtctgcct cgaccgctcg gccgagctcg tcgccgtgct cctcgcggtc 68520 

ctcaaggccg gcgccgccta tgtgccgctc gacgcggcgt acccggccga ccgcatcgcc 68580 

tacaccgtcg gcgacgccgg cctcgcggtc gtggtcacca cctcggcgga ctttcccgac 68640 

gtggacggtg tgcggctgct cgcgccggag agcctcgccg aggccggcga cgacccgggc 68700 

atcccgctcg ccaccccggc cggcccggag cggccggcct atgtcatcta cacgtccggt 68760 

tccacgggcc ggcccaaggg cgtggtcgtc ccgcacgcca acgtgtccgc gctgctcgac 68820 

gccacgcgcg aggagtacgc gctcggcccc ggcgacgtgt ggaccttctt ccactcggcc 68880 

gccttcgact tctccgtctg ggagatctgg ggctgcctgc tcaccggcgg ccacctcgtc 68940 

gtcgtcccgt actgggtgtc ccgctcgccg gagcagttcc acgacctgct cgccgagcgc 69000 

ggcgtcaccg tgctcaacca gacgccgtcc agcttcacgc agctcgtggc cgccgaccgc 69 060 

ggggcggagc gcgacctcgc cgtacgcctg gtgattttcg gtggtgagcc gctcgacgcc 69120 

cggacggtgc tgccctggct ggaccgccgt cccgaggcgc gctgccggct ggtcaacatg 69180 

ttcggcatca ccgagaccac cgtgcacgtc acggcggtcg acgtcacgcg cgcggccgcg 69240 

ctcgccggct cccggtcggt cggccgcccg ctgcccggct gggccgtgcg cgtgctcgac 69300 

gagcagcgcc gcgaggtgcc gccgggcgtg ccgggcgaga tctacgtggg cggcgccggc 69360 

gtggcgatcg gctacctcaa ccgcccggag ctgaccgccg agcggttcgt caccggcccg 6942 0 

gacggccggc gctggtaccg ctccggcgac cgcggccggc tgctgcccga cggcaccctg 69480 

gaacacctgg gccggctcga cgaccaggtc aagctgcgcg gcttccggat cgagctggac 69540 

gagatccggg gcgtgctcac cgagtgcgcc ggggtggcgg cggccgcggt cgtcatccgg 69600 

cgctccactc cggacgatcc ggcgaccgcg cggctcgacg cgtacgtggt cgccgaggcc 69660 

ggcgccacgc cgccggtggc cgagcacgcg gcccggatgc tgccggccta catgtgcccg 69720 

gcgaccttca cgttcctgga cgcgctgccg atgacgccga acggcaaggt ggacaaggcc 69780 

gccctgcccg agcccgcgcg cccggccgcc gacgctgcgg cgacgccggc cggcccgggt 69840 

gaggacgggc tcgcgggcga cctggccgac gtgtggcagc aggtcttcgg ctgcccggtg 69900 

accgtctcgg acaacttctt cgacctcggc ggcaactcgc tgctcgccgt gcggatggcg 69 9 60 

gcgctgatgc gccgccgcgg cctgccccgg ctgcatccgc gcaccctcta cctgcacccc 70020 

accgtgcgcg gcctcgcgga cgcgttgcgc tcggcctgac agtcccctcg ccccctcgag 70080 

aagtacggga gaagcaccat gcgaaacctg cgtcggacca ccggcatcgg actgctcgcg 70140 



ctgctgagcg tggcggcgtg cagctcgacc cccgcggcga gcgagccccc gccgtccgcg 70200 

gcgccgccct cggccgtgac ggccaccggc ccggcggccg agaaggccgt caagtcgggc 70260 

acccagacct atcaccaggc gctcgacgcc ttcgtcgcgg cgagcaacaa gggcacgacc 70320 

gacaccaccg agatcggcaa gtacgcgtcc ggccgggcgc tgatgacctt ccagggcatc 7 0380 

ctcgcctcct accagcagca gggcgtgcac accagcggcg agccgcgcat cgacgagccg 70440 

gtcgtcaccg ggctcacccc gccggccgac cccaccggcg tccagctgcg cggctgcatc 70500 

gacatcagcg cctggccgct gacgaaggcc gacgggaccc cggccgacaa ggtgggcggg 70560 

cagcagggca gcgggcccag cgcgatcctg gcgaacgtcg cccgctcggg tgccacctgg 10620 

caggtgaccg agctggccat ccagggaccc tgcgcggcgt gaccgtccgg cgatggctac 70680 

cggccgggct cacggtcctg gcgttcgccg ccggcttctg gcagaagctg ccctgccagg 70740 

ccgctggctg gccggacgac accgcgacgc tgttcggccg ctactgctac agcgacgtgc 70800 

cgattctctt ccgggagcgc ggccttttcg acggcatttt cccgtacgag tccgggccgg 70 860 

gcgcccagcc gctggagtac ccggtcctca ccggctacct gatggacgcc acggcccggc 70920 

tcgttcgcgc gatcctgccc ggcgcggacg tggccgtcgc ctcccgggcg tacttcctca 70980 

cgacggtcct ggtgctgctc gccctcgcgg tcctgaccgt gtgggcgacc ggtgcggtgc 71040 

tgcgccgcac cggcgggcgg ccgggcgacg cgctgctggt cgccgccgca ccggtgctga 71100 

tcctggccgg cacggtgaac tgggacctgc tcgcggtcgc ggcggcggtg ctcgcgatcc 71160 

tcgcctggga acgggaccgc ccgctgctgg ccggcgtgct gatcgggctg ggcacggcgg 7122 0 

ccaagctgtt cccgctggtg ctgctcggcc cggtgctgct gctctgcctc cggcagcggc 71280 

ggatgcggcg gttcgcccgc gtggccgccg gtgccgccgg ggcctggctt ctggtcaacc 71340 

tgccggtggt cgcgctgcaa ccggacggct ggatggagtt ctggcggttc aacgccgggc 71400 

gcggggccga gttcgggtcg ctctggttcg cgctggacgg gctcggcctg cacatgccgg 71460 

cggtgaacgc cgtcgccctg gcgacgttcg gcgtgctgct ggccgggatc gcggtgctgg 71520 

ctctgcggtc gcgccggccg ccggacctgg cgcaactcgc ctgcctggcc gtcggcgcgt 71580 

tcctgctgac caacaaggtc tactcgccgc agtacgcgct ctggctcctg ccgctcgtgg 71640 

tgatcgcccg tgggcgggtc ccgcggtggc cggtggtgcg cgactgggcc gtctggcagg 717 00 

ccgccgaggt gctctactgg ctcgcggtgt ggagctggct cgccggttcg ctgaccgacg 71760 

^^cggcagta cgcctgggca accgtcctgc gcgtgctcgc cacggcgtac gtctgtggtc 71820 

aggtggtgtg ggacgtgctc gccgcccctc gcccgcaccg gccggcgccg cccccggcgg 71880 

tcgccgagcc ggcccacccg ggctgacccg ggcaacgaga gatcccctcg ccgccggcgg 71940 

cgaggggatc tctgttcgtg tgctcagtcg gcggcgaccc acgtgaggcc gttgccggaa 72 000 
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ggggtcagcc 
ccggcctgga 
aagccggtgc 
gagtgccgcc 
ccgaccaact 
gcccagaagt 
aagacgccga 
accgtgccgg 
aggtgcacgc 
atgcccggcc 
ccgtagtcgc 
tcggcgatca 
cggcggacca 
ccggagccga 
ttctcgaagt 
ccgaccgtcg 
gcggtgcggg 
atcgcctcgg 
agcaggatct 
atccggggcg 
gtgccgccgt 
ttgcagacac 
ttctcccgct 
gaaccgcccg 
aatcctcctc 
tgtccgccca 
ggtcagctcc 
cgtacgccca 
aagagatcgg 
agctcgcggt 



cgccctcgaa 
tctgcgagcc 
cgccctcctc 
gcaccagctc 
cggtgaacga 
acgagctctc 
actcgcgccg 
ccagcacgct 
cggaggagaa 
gccagtacga 
cctcggtgac 
gctcgaacag 
ccgcgccgac 
aggcgacgca 
agccgaagag 
tgtgcagggt 
tctgcccgtc 
tcaccgggtg 
ggtcgaaccg 
aggtggcgaa 
gcttgcgcat 
cgtgcacggt 
ccagcagcag 
gcccgccgcc 
agcgaccatc 
cccgcggcgt 
cctcgaccag 
atttggactg 
ccgcgatctc 
cggagagggg 



cagcggctgc 
ttcgaacatg 
gaggccggcc 
ggtggagccg 
ctccatggcc 
atcgacgtgc 
gtagcgctgc 
gttgatggag 
caccgggtcg 
ggtgctggag 
ccgggtggcg 
cgccttctcc 
gctggtcagc 
gaggatgttg 
cgcgaggttg 
gctgcggttg 
ctgggtgaaa 
gttctcccgg 
gcggcgctcc 
ggtgaacgtc 
gaacccggcc 
ggacggcagc 
caccttggcg 
gaccacgatg 
tgagctgcat 
tcgaccgcgg 
ccggttctcc 
gatcgtggag 
ctggttcgtg 
aatgggctcg 



tcctgcgtgc 
gcctgcgcca 
accgcctggc 
gtcagcgcgt 
ggtgcgctgc 
atgtcgtaga 
tcgaactcgg 
cgggccgcga 
atgaagcagg 
taggagtagt 
tcaccgagga 
tggtcgccct 
gtctcgctga 
ccggagttcg 
cggaagaacg 
cccgaggcgt 
cggacgccgc 
acgtcgacgc 
acctggaacg 
cacggctcgg 
ttggcgatct 
agcgattcgc 
ccctgcttgg 
acatcgaact 
gatggttgcc 
atgtcgccgg 
caggcccagg 
gcgtgcccct 
cggccgcggg 
gagccggccg 



cctccgggcc 
cgacgctcga 
ccagctccgc 
cctcgccgga 
tctcggtcac 
agctcaccag 
tgaaggcgcg 
gcaggccgct 
cggcgtcgcc 
ccttgcggac 
agtcggcgat 
gcaccttgtg 
gcgggatgta 
gcgcgggcag 
gcgagtattc 
cgacgacgaa 
ggacccgctc 
ccagccggcg 
cgtgcgaggt 
tactggtgcc 
cgtcgccgac 
cgatctggta 
cggtcagcgc 
cttccggttg 
gcgggccgct 
tcgcacgctc 
ccgccacgcc 
tcaccgtgct 
cgatcgcccg 
gcgccgccga 



caggatcgcg 
cccgcgcagg 
ggtctgccgg 
ggcgatgccg 
cttgcgggcg 
gaagtcgtgg 
gtcctcgtcc 
gtaggtcgcc 
gaccaggcac 
ccggatctgg 
catcgggcac 
tgccatctcc 
ccagaaccag 
ccgccggccg 
gcgggtgccg 
gcgcgccagc 
gtcgtcggcg 
cgcgttctcc 
cggcccggcc 
ccacttgaac 
cccgagcagg 
ccgcgggaac 
cgcggccgtg 
agcagccacg 
gtgcgccacg 
gcgcgggccg 
gacccggttg 
cagcgagatg 
cgccacctcg 
ggcgttcagg 



72060 
72120 
72180 
72240 
72300 
72360 
72420 
72480 
72540 
72600 
72660 
72720 
72780 
72840 
72900 
72960 
73020 
73080 
73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
73620 
73680 
73740 
73800 



tggttcagca 
cggacggcct 
ccgcgtagcg 
agaggattga 
ggcatgcgga 
tccacgccgt 
cgcaggccac 
gctccctcgg 
ccgtgctcag 
cgccgcggtg 
tctcgtcggc 
gggcgacgtt 
gccactgctt 
gctcgacaag 
cgcgcagcag 
tgaccgccag 
gggtctgcag 
gttcgaggcg 
tccgcagcga 
ccggcgccgg 
gggcggcgcc 
tgaggacggc 
ccgccgtgac 
cgcgccgccg 
agtaggccag 
agaggacggc 
atccaagggt 
tcatgccgcc 
gcccgaaatc 
gccatgtctc 
ggccggcgga 



gccgcacggt 
ccaccagcag 
cgccgtagac 
ccacgccggg 
tgtccaccag 
cgacggcctc 
tgcggatcat 
tgggttgggc 
cgtgccgccg 
gtgcaaccgc 
ctgctccacg 
ggtgagcgcc 
catgccgtcg 
cgtgctcagc 
gcccacgacc 
ccgctcgggc 
gatcatgccg 
ttcctcctgg 
caacccggtg 
gccggcgtcc 
gcccacggcg 
cagggccagc 
gacggcgccc 
gatcagggcg 
gccccagctc 
cagccccgcg 
acggatgccg 
caaggatagg 
aacccttctt 
acgagcgctc 
gggccctgac 



gaccgacggc 
tgtcgggccg 
gtattcgtcc 
gccggccagc 
gcacacgtcc 
ggccaccacc 
cgcctgatcg 
agggtggccg 
aggctctcca 
ggtggggccg 
gtcacggtca 
tcgcggacga 
ggagtggtga 
tcctcgggct 
cggcgcatgg 
acgcgcccgg 
gtgatgtggt 
cggatgcgct 
gccgtgccgc 
caggtcgcgg 
aaggcgggga 
gccatggccg 
gccgcgatcg 
agcaggcaga 
tgcgcgacgg 
tcggccacca 
gcgccgtcgc 
ggcgaatcgg 
agggcggtac 
cactcccgtt 
cgactgcacg 



gagaccagcg 
gcgtccttga 
gagtcgaacg 
gagcgggtca 
ggccgcagct 
tcgatgtcgg 
tcggcgatca 
cgaccgacca 
cccgctctcg 
ccgccgcgtc 
cggtcacgtt 
tccggtagac 
gccgcaccgg 
ccggtgccga 
ccgcgagggc 
cgtcgcgccg 
gcgcgaccac 
cggcgttcgc 
ccagccaggt 
cggccacctg 
tcgtgggtgc 
ggccgggctc 
tgagcccggc 
cgagcgtcgc 
cgacggcctc 
tggtggtcgt 
cggcggcggg 
gcgccgccaa 
cggtccgcgg 
ctgcaggccg 
ctctccgttc 



cgtcgccctt 
ggatgaagcc 
tggtgaccac 
cctcgatgcc 
tgcgcgcctg 
gctcgtcctc 
ggacacggat 
gccgccgccg 
catgccgacc 
gttggtgacc 
gggtgcgtgc 
cgtggtggtc 
gccgccctgc 
cggtgggccg 
ctcggtgccg 
cgcgagcacc 
gtcgtgcagc 
9"cgggcccgg 
gaggatgttc 
gctgagggcg 
ggcccgcacc 
ggccggcatg 
cgcggcgcac 
ggccgcgccg 
gacgacgacg 
cctgtccgcc 
accggttgcg 
ggggtcaaaa 
ggcgctcccc 
agggcctgac 
cctccggacg 



gtgcgccgcc 
gaccgccccg 
gatgacccgc 
gtcgatgcgg 
tgcgaccgcg 
gaggatcagg 
cgtcatcgtc 
ggacgggggc 
agaccgtacc 
tcgaccctga 
ggcgcgtgcc 
acctcgatcg 
cgggagaagc 
tcgtcggcgt 
gccgtttcga 
tgggcggcct 
tcccgggcca 
ccgtccaccg 
agccaggtca 
accacgccga 
gccgagccgg 
tgcagcacgg 
ggcaccgggc 
ccgatcaacc 
gcggccacga 
gtccgcacgg 
gccgcgatgt 
gtagggtgac 
gacgatgaag 
gaaacgctac 
ggtgatcgcg 



73860 
73920 
73980 
74040 
74100 
74160 
74220 
74280 
74340 
74400 
74460 
74520 
74580 
74640 
74700 
74760 
74820 
74880 
74940 
75000 
75060 
75120 
75180 
75240 
75300 
75360 
75420 
75480 
75540 
75600 
75660 



ctggtcggac 
gcgccgagcc 
ctggcgcggg 
gaccGcctca 
cgcatcgcct 
cgggccgagc 
gagcccggcg 
ttcgtcgccg 
cggctctgcg 
cgggacctgc 
ccgccgggga 
cgggccgagg 
ctggtgctca 
gcgcccggtc 
cggtcggcgg 
acggcctcgc 
tggagccggc 
taccggtcgg 
agctcccggc 
ccgaccgggc 
atgccgctga 
ttgtccggcg 
agcagcatca 
cggaccagct 
accgagcgca 
ggcgcgtaac 
aagacgtcga 
aggtcggcga 
accgcgctgt 
gcccgcagcg 



cgcgcggctc 
ggggccggat 
tgggatacgt 
cgatgggcgc 
cggccggcat 
tggcgttgac 
cggtgctgga 
agatcgacgc 
accacctgat 
tcgcccggca 
tggagcccat 
tggacctgcc 
gctatctgag 
agccggggag 
ccatctcgaa 
cgccgagccg 
caccggcgcg 
ccagcgcgaa 
a-ggcgtcccg 
cctcgtcgtg 
ccagccggga 
gcggctcctc 
ccaggcaggc 
tctggaacat 
gatcgtcgat 
cggccgtggc 
cggtgagcag 
cggcggcccc 
ccagcgggcc 
ccacctcgat 



gggcaagtcc 
ccgggttctg 
accgcgggag 
gcggctcaat 
tccgcgtacc 
cctggccggc 
tgcgccggcc 
gagcgtgctg 
cgtgctgtcc 
ccaccgcatc 
ctgggtggag 
ccggcggccg 
ccgggcctcg 
ctgacgggcg 
gagctggtgc 
gccggccagc 
ccacaccccg 
ggcggccggg 
caggccggtc 
ccacgagtcg 
gaagtccaac 
ggccgtccac 
cgcgcgggac 
cgcgtacggg 
cgcgtccagc 
ctccggcagc 
ggcagcatag 
gtacagccgg 
caccagcagg 
gccgtgtccc 



acactgctgc 
ggggagcgcc 
ccggcggtgt 
ccgcggtggg 
<^ggcgcgcgg 
ggcaagcgcc 
cgcgcctcgt 
atctccggtc 
gactcccggg 
atcgcgccgc 
gacttcggct 
tggacggtgg 
ggcgcgcccg 
ctcaggtcca 
tccacgtgct 
ccgtccagga 
agggcgcagg 
ttggcgtcga 
agctcccggt 
tgcaggacgc 
ggggacagct 
gaccggcggg 
acgtgcgcga 
ccctgccgca 
acgatctgcg 
aggtgcagcg 
gcgcgcgcga 
cgctccatcg 
gccggcgaca 
tgttcgccca 



agctgtgctg 
cggacgcggg 
acggctcgtt 
accggcggct 
accggctctc 
cggagctgct 
tcctgcgcgg 
acccgtccgg 
tgctcgtcgc 
gcggcgagct 
cgtacagcgg 
agcgggtcga 
cgctcgccgg 
gcagccggct 
cgcgctcggc 
cggccagcag 
ccgcggccag 
tcgtgatgtc 
ggaaccgctc 
cggccagcgg 
cgccgccgag 
ccagccgggg 
aggaggcggg 
gataaccctg 
cggtcaggta 
cccgcaggcc 
tggtggtgcg 
ccatctcggt 
ccatccgggt 
acagcgcgga 



cgggatggtc 
cgcggcgcac 
cacggtggaa 
ggccgaccgg 
cgccggccag 
cgtgctcgac 
cgtgctcgac 
agaggtggag 
cggcgacgtc 
ggaccgcctg 
gggagtggtg 
gctcgaggag 
ctgcctgatc 
ggtgcgtacc 
cacgctcagc 
cgccggccgg 
cagcacggtg 
ggccggcccc 
ggcgaaccgg 
gtcgccacgc 
ggtgaacacg 
caggtgcggc 
cgcggcgtcg 
cgcgcccagc 
cttcacggcg 
cacgcccgag 
cacgtacggc 
ggccagccgc 
gacctgatag 
ggacggcacc 



75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 



gggcagtcgt 
cgcgggcggt 
tgcgaccggc 
cgcaggttcg 
gccgcgcact 
atgcgcttgt 
caccacacgt 
tcgcgccgcc 
ccgtacgcca 
tcggccagca 
ccggggtcgc 
ccgcccggac 
cggtgcgtgc 
ggccgggatc 
gcgcggccag 
ccgaggtcca 
gctcgtggcc 
tccgccgcgc 
ggacggcggc 
agagcagggt 
cggccagccc 
ggtgcgccag 
ggtcgcgcac 
agccgagcag 
tgacgatcgc 
cgacggcggt 
ggtggccccg 
tcagccgcag 
gctgcagcga 
cgggcaggcc 
tggccacctt 



cgaagaccag 
ccaccgcccg 
tgccgcgcgc 
tcacgatctc 
cggcggcgga 
tggccagcag 
tcaccgaggc 
agaccgcgcg 
ccggcacgaa 
tctcctgccg 
gggcgtcgcc 
ggtgccggcg 
ggccgccgcg 
gggcgccggc 
cacccacagg 
cagcagcagg 
caccatggac 
ctcgccggcc 
cgacacggcg 
cagggcgccg 
ctcggcggag 
cgccgtgcgg 
cagcttgggg 
ttcggccagc 
cgacgtcacg 
cgcctcggcc 
gtcgagcagc 
cgcgcggtcg 
gcgcagcacc 
ggcgtcctcg 
gggccggggc 



cccgcccagg 
cgcggccggc 
ctcgccggtg 
cttgtgcccg 
gaagtcggtg 
caggccggcc 
gatgagcgag 
cagcacctcg 
ctcggcgttg 
ctcgtcggcg 
gagcagcttc 
ggcggccgca 
acccgcgcgg 
gcccgcagca 
ccgtccgccc 
cacgccgcgc 
ggccgggccg 
agcgcggccg 
agcgccggca 
cggtccagcg 
acggcgccgg 
acgaccgggg 
aacatgccgt 
tcacgcagca 
ctcatctcgg 
gcggcgagca 
ggccgccgca 
ccggcgccca 
gtctccagcc 
aacgcgatgc 
agccacgtgc 



tcgatgccgc 
agctcgtcgc 
cgggcgaaca 
gacagcaccc 
ccgtgcgcga 
gcccggcgcc 
ctgaagccgt 
gccagccggt 
agctgatagc 
gcgagcacgg 
tcgagcgccg 
gggccgtgcc 
ccagcagccg 
cccgggccag 
agagcgggtc 
ccgcgtagca 
agggcccgag 
cgtgccgggc 
gcgcctgcac 
gcggtggcgg 
ccgcgaagcc 
tgctgccgtc 
gctcgtactc 
cgcggtccac 
ccgtgaggct 
gggcggcgca 
tgatcatccg 
gcgacagccc 
cgcttccggc 
cggagatgtc 
cgggggcgag 



gcatcccgct 
gcaccagcag 
gcaccatcgc 
atccgccgcc 
gctcgtggaa 
gttgctcctc 
agcccagccc 
cggcgcgggc 
ggtcgagcag 
ccgcatagcc 
cggcggtcac 
gaacggggtc 
gtcggcgccg 
ctcccggcgc 
cgcggtgtgc 
ccactcgtag 
ctcggtcatc 
cagccccgcc 
caccgagcac 
ctgaccggcc 
ggccgcgagc 
gaacaccgcg 
gtcacgcagg 
caccgtgggc 
gtgcaccgag 
gcgcgccaga 
ctgggccacg 
ggcgcacatg 
ccgcccgagc 
gatgccccgc 
cgcctccttg 



gcccggatac 
gaactggctg 
ctcggcccgg 
ggccgggcgg 
cgcggccgcg 
gttgccggcg 
caggcacggg 
cagccggccg 
ccgctcgccc 
gaccgggttg 
cggccgtcct 
accggttcac 
tcgtcgtgcg 
acggcacgca 
cggccctccg 
gcggcggcca 
tgcccgcaga 
gggccacccc 
ccgtggcggg 
gcgctcgcct 
cgcgggaact 
accacgtggt 
aaggaccgcg 
acgtacgcct 
cgcgtgccga 
atcccggccg 
aagcgggcgg 
gtgcgggtca 
agggccgtgc 
accccgtgcg 
tcgaccagga 



77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
78600 
78660 
78720 
78780 
78840 
78900 
78960 
79020 
79080 
79140 
79200 
79260 
79320 



acaggctctg 
gggcccgggt 
ggggcagggc 
cccatgcgac 
caccggccag 
tcaggtcgcg 
caccgcccccL 
gcagccgggc 
cctcggggcc 
tcatcggccc 
cgtgatgccg 
ggtccggcgc 
gacggtgagc 
cggatgcacc 
cccgatctgg 
ggccaggttg 
gcctccggtc 
cgcgggcctg 
cgcgccgggc 
gtccgggtcg 
ggccgcggcg 
gtcggcggga 
ctcggccatg 
ctcggtgaac 
gatgcgggac 
gtacgcgaag 
gtccgccggc 
caggcccgcg 
gaacacgtcg 
ggcccggtgg 



cccgcgggcg 
cgcgttgttg 
cgtggtcgtg 
cggggtgccg 
ccagaccggc 
ccgggcgacc 
ctccggcggt 
cgggaactcc 
^gagcaggtcg 
gcgcccgcct 
ccggccagga 
accgtgccgc 
cggcggctga 
tcctgcacca 
tggtccaccg 
cggccgtgca 
agcgcaccca 
gcgtcgaaga 
ccgcgcagcc 
acgatgagca 
gggtcggccg 
agaaagaccg 
ccgtagcagg 
gcccggacgg 
aggtcgaggc 
ttgggcgcgg 
cggcggacga 
gtgagcatgg 
tcgctgttgt 
ctgagcatca 



tcgccggccc 
atcggccact 
ccgtgcagca 
gcgagcaccg 
gccgacccga 
gcacgccaca 
acgtaccagg 
tcggcgatgt 
tccaggcgca 
gcaccggccg 
accgctcgcg 
gccgcaccag 
cggcgctcgc 
gcacgatccg 
cgtcgtgcgc 
cgatgagcag 
ggtcacccgt 
tgccggggtt 
agatctcgcc 
cctcgaagtc 
ggcgcagggc 
ccggcgcctc 
ggttcaccgc 
tggccgggtt 
cggcgatctg 
cggagatggt 
aggccgtcgg 
cgaacaggcc 
gcccgctgct 
cgcccttggg 



ggccggtgcg 
tgagaccgcg 
ggtcggcgcc 
cggcggcaag 
ggtaggtctt 
ggtgcaggag 
attgcagacc 
cgcgggcggc 
cctccgcggg 
cagcgccggc 
gaccgcggtc 
cagcacgttg 
cacccgcggc 
ctcgtcgggc 
ggcccgggcc 
ctccttgagg 
tcgcacccag 
gagctcggtc 
gacccggccc 
gggcacccgg 
cggcgacgcg 
gaagaccttc 
gtgggtgtgc 
gaccggctcc 
ctcgtcccgc 
gacccggtag 
cgacatcagc 
catgtcgtgg 
gcggtcgaac 
cgagccggtg 



ggccagcacg 
cagccggtag 
gtggtcgggc 
cgtcgcgcgc 
gccgtgcgcg 
ccgctcgtgg 
gaaggagttc 
cgtctcggcg 
tggcgccagc 
tcgagctcgg 
cggcggatct 
cgtaccggca 
agctcgtcga 
gccggcaccc 
tcccgctcga 
cgtccggtga 
ccgccgtcct 
ctgccccagt 
tcgggcagcg 
ccgacacccg 
gcgtcctcca 
gtcgacacgt 
aggccgaagg 
gatccgttgt 
acggcgcgca 
cggtccatca 
acgacgccgg 
tgcagcggca 
gcgcggaggt 
gagcccgagc 



gtgaggtatc 
ccggcgtcgt 
tcggagagcg 
tggccgtcgt 
acggccgcgc 
tcgccggggg 
agccggtcgt 
tcccatgcgt 
gggcgtacgg 
cgtgcagcgc 
tgccgctcgt 
cgccgaagga 
gcggcgtgcg 
cgaacgccgc 
ggtcgtgcgg 
ggaacagctc 
ggccgtcgcc 
agccggcgcc 
gccgtagccc 
gtatctcccg 
gcgcgcgagg 
acgcggtgaa 
gggcgaagcg 
agagcgtgcg 
cgcacaggtc 
tccgcagcca 
cgccgttcag 
gccagctgcc 
tggccagcac 
tgtactgcag 



79380 
79440 
79500 
79560 
79620 
79680 
79740 
79800 
79860 
79920 
79980 
80040 
80100 
80160 
80220 
80280 
80340 
80400 
80460 
80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 



gacggccagc 
cggaagcgca 
cgccgcgtcg 
ggtgcgctcg 
ggcgtagagg 
gatgcgctcc 
gtcgagctcg 
gagggcgggc 
ttgggtggcg 
gacgccgccg 
caggctcttg 
tccggcctcg 
tgagtgctgt 
attgggcaca 
ccggcggttg 
acgggctgtg 
gcccgcagga 
tggatgacct 
gagccgatct 
atctcgtcga 
tcggccggcg 
tgtgagcttg 
cgtccgggcg 
cccaaccagg 
aaacgccgga 
cgcccgcgcc 
gggcccgccc 
tcgcccgcag 
ggccggcgtt 
ccggcgcctt 
tggccctgcc 



gagtccggcc 
ccgaccggca 
tcggcgttga 
gaggcgccgg 
cacgccaggt 
ccggccggga 
gcgtacgtga 
tcgccggggc 
gcgtcgatga 
gcgggtcgcg 
ggccgcatgt 
ccgaacaccg 
ccctcgtcgt 
gcgtcgcctc 
gcgccaccgc 
cgactgcgcc 
gcatgagcgc 
cctgctcccg 
tctgcgagat 
tgcggttccg 
gtgcgcccgg 
tcggtggtga 
cgtgcgactg 
ccgaacggct 
ggatgccttg 
gcggcgtcgc 
ggcgctgctg 
cggctacccc 
ctggaccggc 
cgtcccgcag 
gcaggccgtc 



ggggcccgcg 
ggccggacag 
cgacggccag 
acgaccctcc 
agagctcgac 
accgggcggc 
gggtgacggg 
gcgcgatcgc 
ccatggcaac 
ctcagctctg 
cggtccagga 
tcgtccagcc 
tgaccaggac 
catcgaacgg 
cagcactatg 
gccggcgcgg 
gatggtcgtg 
ggagagcacc 
cgaggccctt 
caggaccggg 
cgccacgacc 
ctcggggtgg 
aggccgtccg 
cctacagtcg 
gccgccgtgg 
cggtggcggt 
ggcatcgcgg 
atgtactacg 
gcgttcgacc 
gcgctctccg 
gagggggtca 



gggcgggcgg 
gccgcgctcg 
ggcgggggag 
gggcgggggc 
gaactcggcg 
gagccaggcc 
ctcggcgtcc 
ctgcgcccgg 
tgcctctccg 
cggttgcttc 
cttctcgatg 
gtcgggcacg 
caggtagttg 
atcggatgac 
gccgagactc 
ttggacagcg 
ccatcctcgc 
acccgggttc 
tcctgccaca 
atgtccgcct 
tgaatgacgt 
gcagccggga 
tcagtcgaac 
ccgtcgatcg 
atgtcccgag 
tctggcagtc 
cgctggcggc 
ccgtggcggt 
cggccgcctc 
cccgcgtctt 
tcgccgtgct 



atcgcttccg 
cgcaggacgg 
cagtcggccg 
gccggcacgg 
ccggtgggca 
gcccgcgccg 
gggtcgccgg 
aaggcgtcgg 
gaagggtccg 
tgcttctcca 
tagtcgaggc 
tcggcgaaga 
gcgtcgggat 
ggtggtcagg 
ggaacggccg 
gtcagagcgg 
cgagcgcggc 
cgccggaggc 
gccgcatgat 
ccggcacggc 
tctccaccat 
cgttaacacc 
gcacgctttc 
cagtcggtgt 
ggtgcgcccg 
gccggacggc 
cgtgctgtac 
gaagagcatg 
gatcacgatc 
cggcttccac 
ggtcctctac 



ccgcgagggg 
cggtgagcgg 
cgatgccgac 
cgaccaggcc 
gggcgatgag 
cggcgcggcg 
ggagcaccac 
gaacggtggg 
ccagggccgc 
tcgccgcgat 
agtcctgccg 
ccggccagag 
cttcaaacgg 
agccgcacgg 
acaagcgtgc 
gccgcggccg 
gcggaagcgc 
cagccggatg 
ggcggcgtcg 
ctcgccgtgc 
cgccgtctcc 
gccgcgcggc 
gtgcgcactg 
a.acaccgtcg 
cccggtgccg 
cagccggcgt 
acggcgaacc 
tcggtgagct 
gacaagctcg 
cagtggtccc 
cgggcggtgc 



81180 
81240 
81300 
81360 
81420 
81480 
81540 
81500 
81660 
81720 
81780 
81840 
81900 
81960 
82020 
82080 
82140 
82200 
82260 
82320 
82380 
82440 
82500 
82560 
82620 
82680 
82740 
82800 
82860 
82920 
82980 



ggcgctggca 
tcgtgtcctc 
tcgcggccga 
cgggcgcctg 
cggccctggt 
atgtcgcggc 
ccctgctgcc 
ccatggtgtt 
cgaccggctt 
atcgcctcgc 
gcctggcccg 
cggccgcggt 
ccccgccgct 
acggcacggc 
ggctggccac 
ccgtcctggc 
tggtggtcgc 
gatacgccgg 
tcgacgacga 
cggcccaccg 
ggctcatcgt 
cgttcccgtc 
tcctcggcgg 
gggtcctcga 
cggtcctgcg 
gtggacaacg 
gtcgaatccg 
ggcgacgcgc 
ctgaccgcgg 
ggtgtgtcgg 



cgggcccggc 
gatgttcggc 
cgcgttcggc 
gatcgggctc 
cgtcacctat 
ggcggtggcg 
cggctcgcac 
cggctacaac 
caccgacggc 
cacccagatc 
gtggcgcacc 
ggtgctcagc 
ggcggcgctc 
gccctggctg 
ccgctatccc 
cgtggtggtg 
cggcctgctg 
cacgtcgttc 
caccaccgac 
cgacggccgc 
cccgaccggg 
gctggccggc 
cccggagggc 
gacgtgcgcc 
atgcgacaag 
gcaccttcac 
ccagcgcccc 
acgcgccggt 
cgcccggcgg 
ccatcgcgct 



gccgggctgg 
cactccatgg 
gcggcggtga 
ggcttccagg 
ctcgccggcg 
gcgaccctgg 
cggccgtggg 
ggtttcgacc 
ggggccgcgg 
gggtggtggt 
gcgcgcgccg 
cggatcacca 
gcggcggccg 
ctgcccgcgg 
tcgttcctgg 
ctggccgcgc 
gcggtgccgg 
gaggccggtg 
cggctgacgc 
acctacctgg 
cactcctacc 
gtgcagcgcc 
ctcggcggcg 
accgtgccgc 
ccctgacaac 
cgatctgcgc 
gtttacggag 
gcggcgggtg 
ggaccatccg 
cagcacgaga 



ccgcggccgg 
aggacggtgc 
cccgcggcag 
cgaagatgat 
caccggtgcg 
cggtctcgct 
cggacggcac 
gggccggcat 
ccggcggttc 
acccgctggc 
gcctcctgtt 
ttcagcacaa 
gcgcggtgca 
tcgtggtcgt 
ccgggctgac 
ggccgacggc 
tcgcgtgggg 
cggggccgag 
cgggcctgcg 
cggccacgtc 
tgccgctcgg 
tggtccgcga 
aggccaccga 
cggccgagca 
cgacgtagca 
atcgaccaca 
ggctacggct 
gcgctgggac 
gccatggcct 
gacgcccacg 



gctgttcgcc 
gctgacgctc 
cccggcccgg 
gcaggcgtgg 
ggcgcgggcc 
gctctgggtg 
cacctccggc 
ccacgtgccg 
ctggacggcg 
gctgaccggc 
ctggggactg 
cgcctacctg 
gctctggcgc 
ccaggccggc 
gtggacggcg 
ccggcggccg 
cgcctcggtg 
cgggccggtg 
caggctcgac 
ctcgtggcgc 
cggcttcagc 
cggcgagctg 
ggcgtaccgc 
cggcgccgat 
cgacgaaccc 
tcgaattcgc 
tctcggtgta 
gcgacgacat 
acgtcgagca 
cggcgttcsic 



acgacgccga. 
tgcctggtgc 
ctggcgctcg 
ctggtgctgc 
cgggtcgtcc 
ctggcgctga 
aacgccttcg 
ggcgcgctga 
cttgccgcgg 
ctgctgctcg 
tggttgctga 
gccgtgctgg 
acgcaccgcg 
tggaccctgt 
ccgatcgccg 
gccgtcgtcg 
ctgaacccgc 
ggcgtgcggc 
gactatctcg 
acggccggcc 
ggagcggcgc 
cgctacttcg 
atcaccggct 
ccggatctca 
crgo^aggaaca 
ggtcgcggac 
cggcgggacc 
ccggctcgtg 
gcacggcgac 
cgaggcggtg 



83040 
83100 
83160 
83220 
83280 
83340 
83400 
83460 
83520 
83580 
83640 
83700 
83760 
83820 
83880 
83940 
84000 
84060 
84120 
84180 
84240 
84300 
84360 
84420 
84480 
84540 
84600 
84660 
84720 
84780 



cggcgaggcg 
atccgcggct 
cggaccctgc 
caggcgatcg 
ttctaccgcg 
caggcgatgg 
gagcccgaca 
ccgggcgtgc 
tccgagcacg 
cgcatcccgc 
caggaccacg 
atcttcatgg 
gcgctgtacg 
gtcgccggca 
gctcgcggcc 
cggggagatc 
cccgacggcc 
cggcctggcc 
gcaccgcggc 
catcgcgctg 
cttcaccgag 
ggcccggatc 
cagcgccgcg 
cacggcgagc 
ggtgctggtg 
gtatcttctg 
cggcctgcac 
ggcggcgctg 
ggtgttcttc 
cctcgaactc 
gccggcggtg 



ccgtcggggt 
tcggtgacgt 
ccggcctgga 
accacatcgc 
acgtcctcga 
actccaaggt 
cgtcgctcga 
agcacctcgc 
gcgtcgagtt 
aggccgggca 
acgggcagct 
aggtgatcga 
aggcggtcga 
acccacgcgg 
ggtgccctgc 
ctcgcgggca 
tggctcttcc 
ctgttcacgt 
accgccgccg 
gccacggtgc 
ttcgcgctct 
ctcgcggagc 
atcgacgacg 
gggccggtcc 
acggtcgtac 
gtggcggtgg 
gcgatcttcg 
cgtgagcggg 
atcgtcaccg 
gccgcgatca 
tcgctcggca 



atccgccccg 
cctgcacacc 
gctgcggcgg 
cgtctgcctc 
cttcgagatg 
ggtgcagagc 
gcagggccag 
gttcatcacc 
cctgcacacc 
cccgatccag 
gttccagatc 
gcgaatgggc 
gctcgacatg 
aactggtgat 
tcggccggtg 
tcgcgctcgg 
cggccgaggc 
tcctgatcgg 
cggtgtcgat 
tgcatccccg 
tcctgggcgt 
ggcgcctcac 
tggtggcctg 
agctcgtacg 
ggccgctgct 
tcgcggtcgt 
gcgcgttctg 
tgcggcagcc 
gcctcggcgt 
tcgtgatcgc 
tgtcgtggcg 



gtcaccggca 
ttcgtcgagc 
cccagcccca 
gagccgggga 
atcttcgagg 
cgc tcgggcg 
atcgacacct 
gacgacgtgc 
ccggactcgt 
gcgctgcgcg 
ttcacgaagt 
gctcgcagct 
tccaagcaga 
cgggaccgtc 
ggtgcggcgg 
cccgagcctg 
ccggccgtac 
ctgggagttc 
cggctccatc 
gcacgacacg 
ggccatgtcg 
cggcacgcgg 
gtgcctgctg 
catcctcgcc 
ggtcttgctg 
gctgctctcg 
cgccggcctg 
gctggaacac 
cgacatcggc 
ctgcgccggc 
ggacgccaga 



acggcgtgac 
gggcgccggg 
cccggttcga 
ccctcgaccc 
agcgcatcct 
gtgtgacgct 
tcctgaagaa 
tgcgctcggt 
actacggccg 
acctgaacgt 
cggtgcaccc 
tcggcagcgg 
gcgcctgagc 
ctgctcgaca 
ctgcgccagc 
ctcggcctgc 
ctgtccgcgg 
aacccggcga 
gcggtctcgt 
accgggggcg 
atcaccgcgt 
gtgggcagca 
gccctggtga 
ctgctggccg 
gcgcggcggc 
gcatatgcga 
gtcatgcccc 
gtaagcgtgg 
gcgctcaccg 
aagctggccg 
accctcggcc 



cgtcgcgacc 
cgcggacccc 
ctcgggcctg 
gaccgtcgac 
ggtgggccgg 
caccctcatc 
ccacggcggg 
cggccggatg 
gctaccgggg 
gctcgtcgac 
ccgcgggacg 
caacatcaag 
cgccgatgga 
tcgcgctggt 
ccgcggtgat 
tgccgggcaa 
tggcccagat 
ctctggcccg 
tcggcctcgg 
ggaaggtcgg 
tcccggtgct 
tcgcgctggt 
cggccatcgc 
tcttcctggt 
cgtccgcgtc 
ccacctggat 
gggagccggc 
tgctgctgcc 
cggcgasicat 
gcgcgatcgt 
tgctcgtcaa 



84840 
84900 
84960 
85020 
85080 
85140 
85200 
85260 
85320 
85380 
85440 
85500 
85560 
85620 
85680 
85740 
85800 
85860 
85920 
85980 
86040 
86100 
86160 
86220 
86280 
86340 
86400 
86460 
86520 
86580 
86640 



cacgcgcggc ctgaccgagc tcgtcgtgct caacgtcggt ctgcagctcg ccgtgctgga 867 00 

cggccagatg ttcacgatga tggtgctgat ggcgctggtg acgaccgccc ttgcgggccc 867 60 

gctgatcgga tcggcccgga caccggcggc cggcgcaccg gctcaggcgc tcccggccga 85820 

accgcggacg cggcgggcgg cgtaggcccg gcgcatcttg atgtagttgc cgcatccggc 86880 

catcgagcac cagcgccgcg cctggttgcg ggaggcgtcg tagtaggccc accggcaggt 86940 

gtcccggtcg cagaccttca gccgggtcca cacctgcagc tcggcgcact gccggaccgc 87 000 

gtcgatcagc ccggccaggg cacggtcgaa cggcgtgccc ccggccggca gcaggcgcgg 87060 

cgcgccgccg gccaggctca gccgcacggg aacggcggcc aggatctcct ccaggcggcg 87120 

cagcgcggcc gggtcggcgg gatggccggc gtggcccagc agcacctggc gaaggccctc 87180 

acgaagcgcg acggcccgtg ccaggtcggc cgggcgcacc ctcgcgccgg gggcgagcag 87240 

gctttccccg gccagccacg cccgaagggc gtccggggtg ctcagcgact cgtcgtcgac 873 00 

ctgaggctca taggtgttga cgaaatcgcg caacagccgg gcaggggatg gcactgcggc 873 6 0 

ctcgtcactc accgttgacc tccttggtgg cgacagcgta gcgccttgac aggcgcgcga 8742 0 

accgcctgga acgatccgct ttcacaccag caaaactact ttggtggtgt gaagaggcgc 87480 

gaaggagcgg atcgtggcca ccacgctgag ggacgtggca cggctcgccc gggtgtcggt 87540 

gaagacggtc tccaacgtcg tcaacgacca cccgcacgtc agcgacgacg tgcgccgccg 87600 

ggtcgagacg gcgatccggc agctgggcta ccgtcccaac ctcgtcgccc gcgccctgcg 87 660 

cagcggccgc ggcagcggac tgctcgccct ggcgatgccg ggcgccggcg cgccgcagtc 87720 

gcccgccctg atcgaggaga tcatccggcg ggcggccccg ctcggattcc gggtcctcat 87780 

cgagccgctc gagtcgtcac ggccgaggcc gccggccccc ggcgtcgacg cccggctgct 87840 

gaacgcggag gcgccggccc ccgagctggt ggacgcccag gcggccacgg gcaccccgct 87 9 00 

ggtgctgctg accggcaccc ccgatccgcg atacgactgc gtcggcccgg acgccgcccg 87960 

cgcggccgag gacgcggtgg accacctgcg ccgtctcggc cggcgccgcg tcgccaccat 8802 0 

cggcggctcg ctctccaccg gtccggccgg ctccggctcc gacttcggtt ccggctccgg 88080 

ttccggctcc ggctccggtt ccggctccgg ctcgggctcc ggctcgggct cgggctccgg 88140 

ctcgggctcc ggcttcggct cgggctccgg cttcggctcg ggctccgcgg agggctaccg 88200 

ggccgcacgg cagttgttag gccacgaaga tcgcccggac gcgatcgtct gcggcagcgt 882 60 

gcggctggcg gtcggcgtga tccgggccgc cgccgacgcc ggcctgcggg tgcccgagga 88320 

cgtcgccgtg atcggcatcg gcgacggcga ggagggccgc tacacgcggc cggccctgac 88380 

cacggtcgcc accgacccgg cgttcatcgc cggcaaggcg c 88421 



<210> 2 
<211> 333 
<212> PRT 

<213> Actinoplanes sp. 
<400> 2 

Met Ser Trp Arg Gin Phe Arg Trp Gin Ala Leu Ala Gly Ala Val Ala 
15 10 15 

Leu Val Pro Leu Val Ala Tyr Leu lie Val Thr Ser Leu Asp lie Arg 
20 25 30 

Arg Ala His Asp Arg Tyr Gin Ala Gin Cys Ala Ser He Gly Asn Cys 
35 40 45 

Ala Glu Ala Met Leu Gin Phe Gin Asn Asp Phe Arg Thr Arg Leu Leu 
50 55 60 

Leu Leu Ala He Leu Leu Ala Ala He Pro Gly He Leu Gly Val Phe 
65 70 75 80 

Trp Gly Ala Pro Leu Val Ala Arg Glu Leu Glu Thr Gly Thr His Arg 
85 90 95 

Leu Val Trp Asn Gin Ser Val Thr Arg Arg Arg Trp Leu Ala Val Lys 
100 105 110 

Val Leu Phe Val Gly Val Ala Ala Met Ala Val Ala Thr Leu Val Ser 
115 120 125 

Thr Leu Leu Thr Trp Ala Ser Ser Pro Val Asp Ala Val Ser Gin Asp 
130 135 140 

Arg Phe Gly Ala Leu Val Phe Asp Ala Arg Asn He Val Pro Val Ala 
145 150 155 160 

Tyr Ala Ala Phe Ala Leu Val Leu Gly Thr Val He Gly Leu Leu Val 
155 170 175 

Arg Arg Thr He Pro Ala Met Ala Leu Thr Met Leu Val Phe Ala Val 
180 185 190 

Val Gin Phe Thr Val Pro Ala Leu Ala Arg Pro His Leu Met Ala Pro 
195 200 205 

Glu Thr Gin Thr Arg Gin Met Thr Leu Gin Glu Phe Gly Glu Val Arg 
210 215 220 

Gly Phe Gly Asp Glu Pro Thr Val Asn Gly Leu Ser He Arg Gly Ala 
225 230 235 240 

Trp Val Thr Ser Thr Ser Pro Leu Leu Thr Ala Asp Gly Thr Arg Leu 
245 250 255 

Asp Lys Ala Thr Tyr Arg Lys Cys Val Thr Asp Pro Pro Ala Val Ser 
260 265 270 



Gly Gly Ala Pro Gly Val Gly Gly Thr Val Ala Cys Leu Ala Asp Leu 
275 280 285 



Asp Leu His Val Glu Val Ala Tyr Gin Pro Asn Asp Arg Tyr Trp Thr 
290 295 300 



Phe Gin Trp He Glu Ser Ala Leu Tyr Leu Ala Leu Gly Gly Leu Leu 
305 310 315 320 

Leu Ala Val Gly Leu Trp Arg He Arg Arg His Val He 
325 330 

<210> 3 
<211> 304 
<212> PRT 

<213> Actinoplanes sp. 
<400> 3 

Met Pro His Glu Asp Ser Ser Pro Val Leu Gin Ala Glu Gly Leu Thr 
15 10 15 

Lys Arg Tyr Gly Arg Arg Thr Ala Leu Gin Asp Cys Asn Leu Thr He 
20 25 30 

Pro Arg Gly Arg Val He Gly Leu Val Gly Pro Asn Gly Ala Gly Lys 
35 40 45 

Ser Thr Leu Leu Gin Leu Ala Cys Gly Leu He Thr Pro Ser Glu Gly 
50 55 60 

Ser Leu Arg Val Leu Gly Glu Thr Pro Ala Ala Asn Ala Gly His Leu 
65 70 75 80 

Ala Lys Val Gly Phe Val Ala Gin Asp Thr Pro Val Tyr Ser Asn Phe 
85 90 95 

Thr Val Gly Asp His Leu Lys Met Gly Ala Lys Leu Asn Pro Thr Trp 
100 105 110 

Asp Gin Ala Leu Ala Glu Arg Arg Val Ala Gin Val Gly Leu Asn His 
115 120 125 

Gly Gin Lys Ala Gly Arg Leu Ser Gly Gly Gin Arg Ala Gin Leu Ala 
130 135 140 

Leu Thr Leu Ala Ala Ala Lys Arg Pro Glu Leu Leu Met Phe Asp Glu 
145 150 155 160 

Pro Ala Ala Ala Leu Asp Pro Leu Ala Arg Asp Gly Phe Leu Gin Asn 
165 170 175 

Leu Leu Glu Phe Val Thr Glu Leu Asp Ala Ser Ala He Leu Ser Ser 
180 185 190 

His Leu Leu Gly Asp Val Glu Arg Val Cys Asn Tyr Leu He Val Leu 
195 200 205 

Cys Ala Ser Arg Val Gin Val Ala Gly Asp Val Pro Asp Leu Leu Asn 
210 215 220 

Thr His Tyr Arg He Val Ala Pro Arg Gly Glu Leu Asp His Pro Pro 
225 230 235 240 



54 



Ala Gly Leu Glu Val lie Arg Ala Gin His Ala Asp Arg Tyr Thr Thr 
245 250 255 



Ala Val Val Arg Gly Asp Gly Ser Arg Pro Ser Thr Trp Thr lie Glu 
260 265 270 

Pro lie Gin Leu Glu Glu Leu Val Leu Ala Tyr Met Thr Arg Ala Met 
275 280 285 

Gly Val Thr Gly Glu Pro Leu Met Ala Ala Ser Gly Glu Val Val Arg 
290 295 300 

<210> 4 
<211> 336 
<212> PRT 

<213> Actinoplanes sp. 
<400> 4 

Met Ser Trp Arg Gin Phe Arg Gly Gin Ala Val Val Gly Val Val Val 
15 10 15 

Leu Ala Leu Leu Ala Ala Tyr Leu Val Tyr Leu Gly Val Asp lie Arg 
20 25 30 

Gly Ala Tyr Asp Asp Tyr Arg Ala Gin Cys Pro Ala Gly Gly Asp Cys 
35 40 45 

Ala Gly Pro Leu Gly Gin Phe Ser Leu Asp Tyr Glu Asn Thr Leu Leu 
50 55 60 

Tyr Leu Ala Gly Val Leu Ala Leu Val Pro Gly Leu Leu Gly Met Phe 
65 70 75 80 

Trp Gly Ala Pro Leu lie Thr Arg Glu Leu Glu Asn Gly Thr Gin Arg 
85 90 95 

Leu Val Trp Asn Gin Ser Val Thr Arg Arg Arg Trp Leu Leu lie Lys 
100 105 110 

Leu Leu Val Val Gly Leu Ala Cys Met Val Val Ala Gly Val Pro Ser 
115 ' 120 125 

Leu Leu Leu Thr Trp Ala Ala Ala Pro Val Asp Asn Val Ala Asp Asn 
130 135 140 

Arg Phe Ser Thr Val Met Phe Gly Ala Arg Phe Leu Pro Pro lie Ala 
145 150 155 160 

Tyr Ala Ala Phe Ala Phe Val Leu Gly Thr Leu lie Gly Leu Leu Val 
165 170 175 

Arg Arg Thr Val Pro Ala Met Ala Leu Thr Leu Val Ala Phe Val lie 
180 185 190 

Phe Gin Phe Leu Val Pro Asn Leu Val Arg Pro His Leu Met Pro Ala 
195 ' 200 205 



Lys His Leu Val Lys Pro Met Thr Val Ser Ala lie Asn Glu Ala Lys 
210 215 220 



Ser Leu Gly Ser lie Thr Gly Ala Pro Val Leu Asn Gly Leu Ser lie 
225 230 235 240 



Ser Gin Gly Trp He Thr Asp Val Ser Ala Leu Lys Thr Ala Asp Gly 
245 250 255 

Arg Ser Leu Asp Ala Lys Thr Phe Asp Asn Cys Tyr Met Asn Ala Pro 
260 265 270 

Lys Thr Gly Ala Thr Glu Gly Pro Tyr Gly Asp Val Ala Val Cys Leu 
275 280 285 

Ala Lys Leu Asp Leu His Val Asp He Ala Tyr Gin Pro Trp Asn Arg 
290 295 300 

Tyr Trp Ala Phe Gin Phe Leu Glu Ser Gly Phe Tyr Val Leu Leu Ser 
305 310 315 320 

Gly Leu Leu lie "Gly Ala Ala Val Trp Arg Val Gin Arg Arg Pro Ser 
325 330 335 

<210> 5 
<211> 283 
<212> PRT 

<213> Actinoplanes sp, 
<220> 

<221> misc_f eature 
<222> (1) , . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the bio synthesized protein will contain a methionineresidue at t 
his position 



<400> 5 

Val Arg Ser Ala Val Val Val Gly Thr Gly Leu He Gly Thr Ser Val 
15 10 15 

Gly Leu Ala Leu Thr Gin Arg Asp He Thr Val His Leu Leu Asp Ala 
20 25 30 

Asp Pro Ala Ala Ala Arg Ala Ala Ala Ala Leu Gly Ala Gly He Ala 
35 40 45 

Gly Glu Pro Arg Thr Arg Val Asp Val Ala Val He Ala Val Pro Pro 
50 55 60 

Ala Ala Val Ala Pro Val Leu Ala Asp Leu Gin Arg Arg Gly Thr Ala 
65 70 75 80 

Arg Val His Thr Asp Ala Ala Ser Val Lys Val Leu Pro Ser Arg Gin 
85 90 95 

He Glu Val Leu Gly Cys Asp Ala Ser Ser His Val Gly Gly His Pro 
100 105 110 



Leu Ala Gly Ser Glu Arg Ser Gly Pro His Ala Ala Arg Gly Ser Leu 
115 120 125 



Phe Glu Gly Arg Pro Trp Val Leu Ser Pro Gly Arg Arg Ser Ser Thr 
130 135 140 



Ala Ala Val Asp Gly Ala Leu Ala Val Val Ser Ala Cys Gly Ala Thr 
145 150 155 160 

Pro Val Leu Met Ser Ala Glu Glu His Asp Arg Ala Val Ala Leu Val 
165 170 175 

Ser His Val Pro His Leu Val Ala Gly Leu Leu Ala Ala Arg Met Leu 
180 185 190 

Asp Gly Thr Pro Ala Gin Leu Gly Leu Ala Gly Gin Gly Val Arg Asp 
195 200 205 

Thr Thr Arg lie Ala Gly Gly Arg Ala Ala Leu Trp Thr Glu lie Leu 
210 215 220 

Ala Ala Asn Ala Gly Ala Val Ala Asp Val Leu Asp Asp Leu Ser Ala 
225 230 235 240 

Glu Leu Ala Ala Thr lie Ser Ala Leu Arg Glu Leu Glu Ala His Pro 
245 250 255 

Gly Arg Ala Glu Ala Leu Ala Ala Leu Thr Gly Met Leu Gin Arg Gly 
260 265 270 

Val Asp Gly Arg Asp Arg lie Ala Ala Ser Pro 
275 280 

<210> 6 
<211> 336 
<212> PRT 

<213> Actinoplanes sp. 
<400> 6 

Met Glu Ser Leu His lie Ala Ser Ala Arg His Glu Pro Asp Arg His 
15 10 15 

Asp Glu Thr Gin Met Asn Thr Pro Ser Met Met Arg Val Glu Trp Leu 
20 25 30 

Pro Val Asp Ser Leu Glu Met Leu Asp Ser Pro Arg Leu Ala Gly Glu 
35 40 45 

Asp Pro Arg His Thr Gin Met Leu Ala Ser Leu Asp Ala Glu Leu Pro 
50 55 60 

Pro lie lie Val His Arg Ala Ser Met Arg Val He Asp Gly Ala His 
65 70 75 80 

Arg Leu Gly Ala Ala Arg Leu Arg Gly Asp Glu Leu He Lys Ala Ala 
85 90 95 

Met Phe Glu Gly Ser Glu Gin Glu Ala Phe Val Leu Gly Val Lys Ala 
100 105 110 



Asn He Ala His Gly Leu Pro Leu Ser Thr Ala Asp Arg Thr Arg Ala 
115 120 125 



Ala Glu Arg lie lie Glu Ser His Pro Ser Trp Ser Asp Arg Thr lie 
130 135 140 



Ala Ala Ser Ser Gly Leu Ser Ala Arg Thr Val Gly Asn lie Arg Arg 
145 150 155 160 

Arg Leu Glu Leu Ser Gly Asp He Gly Gin Gly Ser Arg Thr Arg Val 
165 170 175 

Gly Arg Asp Gly Arg Val Arg Pro Leu Asp Asn Ser Glu Gly Arg Leu 
180 185 190 

Lys Ala Val Ser Tyr He Gin Gin Gin Pro Asp Ala Ser Leu Arg Glu 
195 200 205 

He Ala Lys Asn Ala Gly Val Ser Pro Ser Thr Ala Arg Asp Val Arg 
210 215 220 

Asn Arg Leu Gin Arg Gly Glu Asp Pro Leu Pro Gly Pro Arg Arg Thr 
225 230 235 240 

Gly Gly His Arg Asp Asp He Ser Phe Asp Lys Glu Asn Thr He Arg 
245 250 255 

Leu Leu Glu Pro Thr Val Arg Ser He Leu Gin Gly Leu Lys Asn Asp 
260 265 270 

Pro Ser Leu Arg Phe Thr Glu Ser Gly Arg Asn Leu Leu Arg Trp Val 
275 280 285 

Leu Ala Arg Thr Val Gin Asp Asp Glu Trp Lys Asp Met Leu Asp Ala 
290 295 300 

Val Pro Ser His Cys Thr Tyr Val Leu Ala Asn Val Ala Arg Arg Cys 
305 310 315 320 

Ser Gin Glu Trp Leu Glu Phe Ala Glu Thr Leu Glu Lys Asn Ala Ala 
325 330 335 

<210> 7 
<211> 444 
<212> PRT 

<213> Actinoplanes sp . 
<400> 7 

Met Ser He Leu Arg Glu Ala Pro Gly Thr Gly Arg Val Leu Arg Arg 
15 10 15 

Glu Asp Leu His Gin Ser Leu Ser Asp Pro Leu Leu Asp Thr Met Asn 
20 25 30 

Phe Leu Asn Glu Val Thr Ala Arg Tyr Pro Arg Ala Val Ser Phe Ala 
35 40 45 

Pro Gly Arg Pro Phe Asp Gly Phe Phe Asp Val Glu Gin He Phe Arg 
50 55 60 



Gly He Arg Gly Tyr Leu Glu His Leu Ala Gly Gin Gly Arg Ser Pro 
65 70 75 80 



Ala Glu lie Arg 



lie Arg Glu Val 
100 

Val Ala Pro Glu 
115 

Leu Leu Ala Leu 
130 

Val Ala Ser Pro 
145 

Asp lie Asp Pro 



Glu Asp Leu Ala 
180 

Pro Arg Ala Phe 
195 

Thr Met Pro Leu 
210 

Leu Gly Leu Leu 
225 

Gly Gin Gin Leu 



Val His Leu Gly 
260 

Gly Phe Ala He 
275 

Gly Leu Leu Ala 
290 

Asn Thr Ser Pro 
305 

Ala Gly Gly Thr 



Gly Ala Ala Met 
340 

Ala Ser Phe Arg 
355 

Gly Phe Phe Leu 
370 

Leu Ser Arg Ser 
385 

Tyr Phe Tyr Pro 



Asp Ala Val Phe 
85 

He Ala Gin Trp 



Ser He Val Val 
120 

Arg Ala Leu He 

^ 135 

Cys Tyr Val Gly 
150 

Val Pro Val Ala 
165 

Arg Ala Val His 



Tyr Val Val Pro 
200 

Glu Ala Arg His 
215 

Val He Glu Asp 
230 

Pro Ser Leu Lys 
245 

Ser Phe Ser Lys 



Ala Asp Gin Pro 
280 

Asp Glu Leu Ala 
295 

Leu Ser Gin Ala 
310 

Ala Ala Glu Ala 
325 

Arg Arg Thr Leu 



Ala Arg Thr Gly 
360 

Ala Val Asn Val 
375 

Ala Glu Asp His 
390 

Ala Gly Gly Gly 



Gin Tyr Gly Pro 
90 

Leu Arg Arg Asp 
105 

Thr Val Gly Ala 



Arg Asp Glu Arg 
140 

He Thr Gly Ala 
155 

Glu Arg Glu Asp 
170 

Ala Glu Leu Ser 
185' 

Asp His Thr Asn 



Ala Leu Leu Asp 
220 

Ser Pro Tyr Arg 
235 

Ala Leu Asp Pro 
250 

Thr Leu Phe Pro 
265 

Val Ser Asp Ala 



Lys Val Lys Ser 
300 

Ala Val Ala Gly 
315 

Ser Ala Glu Ser 
330 

Asp Arg Leu Glu 
345 

Val Arg Trp Asn 



Pro Phe Thr Ala 
380 

Gly Val He Trp 
395 

Glu Gin Gly He 



Ala Ala Gly Arg 
95 

Glu Gly He Asp 
110 

Gin Glu Ala Met 
125 

Asp Ala Leu Phe 



Ala Arg Leu Leu 
160 

Gly Phe His Pro 
175 

Arg Gly Arg Arg 
190 

Pro Ser Gly Ala 
205 

Leu Ala Gly Glu 



Leu Val Ser Pro 
240 

Gly Arg His Val 
255 

Gly Ala Arg Val 
270 

Ala Gly Gly Ala 
285 

Met Val Thr Val 



Met Leu Leu Ala 
320 

Ser Ala His Tyr 
335 

Glu His Leu Pro 
350 

Arg Pro Ser Gly 
365 

Asp Asn Ala Ala 



Thr Pro Met Ser 
400 

Arg Leu Ser He 



405 



410 



415 



Ser Tyr Leu Thr Pro Glu Glu lie Asp Glu Gly Val Lys Arg Leu Ala 
420 425 430 

Gly Phe lie Thr Thr Glu He Ala Ala Leu Arg Pro 
435 440 

<210> 8 
<211> 356 
<212> PRT 

<213> Actinoplanes sp, 
<220> 

<221> rnisc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 8 

Val Thr Ala Thr Ala Leu Leu Pro Leu Thr Leu Ala Asp Tyr Glu Gin 
15 10 15 

Leu Ala Gin Ala Arg Met Glu Pro Pro Val Trp Asp Phe He Ala Gly 
20 25 30 

Gly Ala Gly Glu Glu Leu Thr Leu Ala Ala Asn Thr Ala Ala Phe Ala 
35 40 45 

Pro Pro Arg Leu Arg Pro Arg Val Leu Thr Gly Ala Gly Ala Pro Asp 
50 55 60 

Thr Gly Thr Thr He Leu Gly Arg Arg Trp Ala Ala Pro He Gly Val 
65 70 75 80 

Ala Pro Leu Gly Tyr His Thr Leu Val Asp Pro Ala Gly Glu Val Ala 
85 90 95 

Thr Ala Ala Ala Ala Gly Ala Ala Gly Leu Pro Leu Val Val Ser Thr 
100 105 110 

Phe Ser Gly Arg Thr Val Glu Asp He Ala Ala Ala Thr Thr Ala Pro 
115 120 125 

Arg Trp Leu Gin Val Tyr Cys Phe Arg Asp Arg Ala Val Thr Ala Ala 
130 135 140 

Leu Val Thr Arg Ala Val Arg Ala Gly Phe Glu Ala Leu Val Leu Thr 
145 150 155 160 

Val Asp Ala Pro Arg Leu Gly Arg Arg Leu Arg Asp He Arg Asn Asp 
165 170 175 

Phe Arg Leu Pro Pro Gly Val Ala Pro Ala Asn Leu Thr Gly Asp Gly 
180 185 190 



Phe Ala Ser Pro Ser Gly His Ala Leu Gly Ala Phe Asp Ala Ala Met 
195 200 205 



Asp Trp Thr Val Val Ala Trp Leu Arg Glu Leu Ser Gly Leu Pro Val 
210 215 220 



Leu Leu Lys Gly Val Leu Thr Ala Asp Gly Ala Arg Arg Ala Leu Asp 
225 230 235 240 

Ala Gly Ala Asp Gly lie Val Val Ser Asn His Gly Gly Arg Gin Leu 
245 250 255 

Asp Gly Val Pro Ala Thr Leu Asp Val Leu Pro Glu Val Val Ala Ala 
260 265 270 

Val Ala Gly Arg Cys Pro Val Leu Leu Asp Gly Gly Val Arg Arg Gly 
275 280 285 

Arg Asp Val Leu Leu Ser Leu Ala Leu Gly Ala Asp Ala Val Leu Val 
290 295 300 

Gly Arg Pro Val Leu Tyr Gly Leu Ala Val Gly Gly Thr Ala Gly Val 
305 310 315 320 

Arg His Val Leu Asp lie Leu Ala Gly Glu Leu Thr Asp Asp Met Ala 

r^^i 325 330 335 



Leu Ala Gly Val Ala Ser Pro Ala Asp Ala Gly Ala Asp Leu Ala Gly 
340 345 350 



ffl Pro Val Ala Pro 

P^j 355 



<210> 9 
<211> 640 
<212> PRT 

<213> Actinoplanes sp . 
<220> 

<221> misc_f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized prptein will have a f ormylmethionine at this 
position 



<400> 9 

Val Ala Thr He Asp Gly Pro Asp Leu Gly Val He Gly Leu Arg Val 
15 10 15 

Asp Gly Leu He Pro Met Gin Lys Val Arg Pro Gly Thr Val Arg Arg 
20 25 30 

He Leu Pro Tyr Ala Lys Lys His Arg Trp Ser Leu Ala Val Ala Leu 
35 40 45 

Leu Met Thr Val Val Asp Ala Ala Leu Thr Val Ala Asn Pro Leu Leu 
50 55 60 

Leu Lys Gin He He Asp Arg Gly He Val Ala Gly Arg Leu Asp Val 
65 70 75 80 



Val Val Gly Leu Ser Leu Val Val Ala Gly Leu Ala Leu Val Asn Val 
85 90 95 



Ala Ala lie His Val Gin Thr Leu Ala Ser Gly Arg Val Gly Gin Gly 
100 105 110 

Leu lie Tyr Asp Leu Arg Thr Lys Val Phe Ala His Val Met Arg Gin 
115 120 125 

Pro Leu Ala Phe Phe Thr Arg Ala Gin Thr Gly Ser Leu Val Ser Arg 
130 135 140 

Leu Asn Thr Asp Val Val Gly Ala Glu Gin Ala Met Thr Ser Met lie 
145 150 155 160 

Thr Gin Thr Val Ser Thr Val Leu Thr Val Val Leu Val lie Gly Ala 
165 170 175 

Met Phe Tyr Leu Ser Trp Ala lie Ala Leu Val Ala Leu Val Leu lie 
180 185 190 

Pro Leu Phe Phe Leu Pro Gly Lys Leu lie Ala Gly Arg Leu Glu Arg 
195 200 205 

Leu Ala Arg Gly Gly Met Gin Val Asp Ala Glu Leu Gly Ser Met Met 
210 215 220 

Asn Glu Arg Phe Asn Val Ser Gly Ala Met Leu Val Lys Leu Tyr Gly 
225 230 235 ^ 240 

Arg Pro Glu Ser Glu Glu Thr Ala Phe Ala Gly Arg Ala Ala Arg Val 
245 250 255 

Arg Asp lie Ala He Ser Met Gly Val His Ala Arg Leu Leu Phe lie 
260 265 270 

He Ala Thr Leu Leu Thr Thr Val Thr Thr Ala Met Val Tyr Gly Phe 
275 280 285 

Gly Gly Ala Leu Val He Asp Gly Thr Leu Gly He Gly Thr Leu Val 
290 295 300 

Ala Met Val Ala Leu Leu Ala Gin Leu Tyr Gly Pro Val Asn Gin Leu 
305 310 315 320 

Thr Asn He Gin Val Asp Val Val Thr Ala Leu Val Ser Phe Asp Arg 
325 330 335 

Val Phe Glu Val Leu Asp Leu Asp Pro Leu Val Lys Glu Arg Pro Gly 
340 345 350 

Ala Arg Ala Leu Pro Ala Ala Glu Pro Gly Arg Ser Ala Ala Pro Asp 
355 360 365 

He Glu Phe Asp Asn Val Val Phe Arg Tyr Pro Gly Ala Asp Glu Val 
370 ^ 375 380 

Ser Leu Ala Ser Leu Glu Thr Val Ala Gin Arg Ser Ser Asp Gly Thr 
385 390 395 400 



Ala Glu Arg Pro Val Leu Asn Gly He Ser Phe Leu Ala Pro Ala Gly 



405 



410 



415 



Lys Leu Thr Ala Leu Val Gly Pro Ser Gly Ala Gly Lys Thr Thr lie 
420 425 430 

Thr His Leu Val Pro Arg Leu Tyr Asp Thr Thr Ser Gly Thr Val Arg 
435 440 445 

lie Ala Gly His Asp Val Arg Asp Leu Thr Leu Arg Ser Leu Ser Glu 
450 455 460 

Ser lie Gly Val Val Thr Gin Asp Ala His Leu Phe His Asp Thr He 
465 470 475 480 

Arg Ala Asn Leu Leu Tyr Gly Arg Pro Asp Ala Gly Glu Arg Asp Leu 
485 490 495 

Val Ala Ala Cys Glu Ala Ala Arg He Trp Glu Met Val Ser Ser Leu 
500 505 510 

Pro Asp Gly Leu Asp Thr Val Val Gly Asp Arg Gly Tyr Arg Leu Ser 
515 520 525 

Gly Gly Glu Lys Gin Arg Leu Ala Leu Ala Arg Leu Leu Leu Lys Ser 
530 535 540 

Pro Pro Val Val Val Leu Asp Glu Ala Thr Ala His Leu Asp Ser Glu 
545 550 555 560 

Ser Glu Ala Ala He Gin Arg Ala Leu Asp Thr Ala Leu Ala Gly Arg 
565 570 575 

Thr Ser Leu Val He Ala His Arg Leu Ala Thr He Leu Asp Ala Asp 
580 585 590 

Gin He Leu Val He Asp Asp Gly Arg Val Val Glu Arg Gly Thr His 
595 600 605 

Asp Glu Leu He Ala His Gly Gly Leu Tyr Ala Glu Leu Tyr Arg Thr 
610 615 620 

Gin Phe Ala Gly Gin Arg Thr Glu Glu Arg Gin Pro Ala Val Pro Ser 
625 630 635 640 

<210> 10 
<211> 271 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc„f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It- is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 10 



Val Ser Ala Ala Gly Ser Gly Phe Val Thr Thr Asn Gly Val Arg Leu 
15 10 15 



Ala Tyr Arg Arg Ser Gly Ala Gly Glu Pro Val Leu Met lie Met Gly 
20 25 30 



Ser Gly Ser Ala Gly Gin Thr Trp Thr Val His Gin Thr Pro Ala Leu 
35 40 45 

His Glu Ala Gly Tyr Ser Thr Val Val Phe Asp Ser Arg Gly lie Pro 
50 55 60 

Pro Ser Asp Val Pro Ala Gly Lys Tyr Ser Leu Ala Asp Met Thr Ala 
65 70 75 80 

Asp Thr Arg Gly Leu lie Glu Ala Leu Asp Leu Ala Pro Cys Arg lie 
85 90 95 

Val Gly Thr Ser Leu Gly Ala Met lie Ala Gin Glu Leu Ala Val Asp 
100 105 110 

His Pro Glu Leu Val Arg Cys Ala Val Leu lie Ala Thr Leu Ala Arg 
115 120 125 

Pro Asp Ala Ala Arg Ala Ala Gin Asn Gin Ala Asp lie Asp Leu Leu 
130 135 140 

Glu Ser Gly Val Thr Leu Pro Ala Ala Tyr Glu Ala Ala Thr Ala Val 
145 150 155 160 

Phe Lys Met Phe Ser Pro Ala Thr Leu Asn Asp Asp Val Ala Val Arg 
165 170 175 

Glu Trp Leu Asp lie Phe Glu Leu Ser Gly Thr Gly Val Ser Ala Gly 
180 185 190 

Gly Gin Ala Trp Ala Glu Leu Thr Gly Asp Arg Arg Ala Ala Leu Arg 
195 200 205 

Ser Val Thr Ala Pro Cys Arg Val lie Ser Phe Ala Asp Asp Leu lie 
210 215 220 

Thr Pro Pro His Leu Ala Ala Glu Val Ala Glu Ala lie Pro Asp Cys 
225 230 ^ 235 240 

Asp Leu Val Glu lie Ser Arg Cys Gly His Leu Gly Tyr Leu Glu Arg 
245 250 255 

Pro Asp Ala Val Asn Ala Ala lie Leu Glu Phe Leu Asp Ser His 
250 265 270 

<210> 11 
<211> 529 
<212> PRT 

<213> Actinoplanes sp . 
<400> 11 

Met Gly Asn Ala Asp Gin Pro Arg Tyr Leu Arg Ser Asn Val lie Ala 
15 10 15 



Glu Pro Leu Val Asp Arg Phe Tyr Ala Trp Leu His Thr Val Ala Pro 
20 25 30 



Val Pro Ala Ser Met Asn Leu Ala Phe Leu Gin Val Pro Leu Leu Glu 
35 40 45 



Ser Tyr Leu Gin Ser Pro Pro Val His Val Ala Ala Ser Thr Asn Pro 
50 55 60 

Lys Met Arg Gly Gly Tyr Phe Val Ala Val Glu Glu Ser Arg Ser Asp 
65 70 75 80 

Glu Val Ala Glu Leu Leu Lys Thr lie Lys Asn Glu Arg Ala Asp Met 
85 90 95 

Leu Gly Phe Ala Ala Ala Val Ala Glu Ala Glu Asp Leu lie Arg Glu 
100 105 110 

Asn Ala Val Gly Tyr Asp Leu Thr Pro Leu Tyr Pro Arg Leu Pro Ala 
115 120 125 

Ala Leu Asn Gly Leu Val Glu lie Ala Tyr Asp Thr Ser Asn Gin Pro 
130 135 140 

Ser Leu His Phe Leu Glu Pro Leu Leu Tyr Arg Ser Pro Ala Tyr Asp 
145 150 155 160 

Glu Arg Arg Gin Ser Val Gin Leu Ser Leu Asp Asp Gly Val Glu Arg 
165 170 175 

Pro Phe lie Leu Ser Thr Pro Arg Leu Pro Arg Ala Gly Val Leu Asp 
180 185 190 

Leu Pro Leu Pro Leu Arg His Pro Gly Leu Thr Glu Leu Phe Asp Ala 
195 200 205 

Arg Val Arg Pro Thr Ser Leu Asn Arg Leu Arg Glu Ala Leu Glu Leu 
210 215 220 

Asp Asp Ala Gly Ala Ala Ala Leu Asp Ala Leu Leu Thr Asp Glu Pro 
225 230 235 240 

Ser Leu Ser Pro Asp Arg His lie Glu Ser Gly Gly Arg Val Arg Tyr 
245 250 255 

Tyr Gly His Ala Cys Val Val Met Gin Thr Glu Gin Ala Ala Val Val 
260 265 270 

Thr Asp Pro Phe lie Ser Thr Asp Asn Arg His Gly Asp Arg Tyr Thr 
275 280 285 

Leu Asp Asp Leu Pro Asp His lie Asp Leu Val Leu lie Thr His Gly 
290 295 300 

His Gin Asp His lie Val Leu Glu Thr Leu Leu Gin Leu Arg Gly Arg 
305 310 315 320 

lie Gly Thr Val Val Val Pro Arg Thr Ser Arg Gly Asn Leu Pro Asp 
325 330 335 

Pro Ser He Ala Leu Tyr Leu Arg Arg He Gly Phe Thr Val Val Glu 
340 345 350 



65 



Val Glu Glu Phe Asp Glu Val Pro Phe Pro Gly Gly Thr Val Thr Ala 
355 360 365 

Thr Pro Phe Leu Gly Glu His Ala Asp Leu Asp lie Arg Gly Lys Ser 
370 375 380 

Thr Tyr Phe Val Arg Met Ala Gly Arg Thr lie Phe He Gly Ala Asp 
385 390 395 400 

Ser Ser Gly He Asp Pro Val Leu Tyr Arg Tyr He Arg Asp His Val 
405 410 415 

Gly Gin Val Asp Met Ala Phe Leu Gly Met Glu Cys Asp Gly Ala Pro 
420 425 430 

Leu Asn Trp Leu Tyr Lys Gly Leu Leu Thr Lys Pro Val Asn Lys Lys 
435 440 445 

Met Ser Ala Ser Arg Arg Leu Ser Gly Ser Asn Ala Glu Gin Ala Gly 
450 455 460 

Ala He Met Thr Glu Leu Gly Ala Thr Ala Gly Tyr He Tyr Ala Met 
465 470 475 480 

Gly Glu Glu Ser Trp Gin Gly His Val Met Ala Thr Thr Tyr Asn Glu 
485 490 495 

Asp Thr Tyr Gin Leu Lys Gin He Asp Glu Phe Leu Ala Trp Cys Ala 
500 505 510 

Asp Arg Gly Phe Thr Ala Glu His Leu Phe Asn Lys Arg Glu Trp Arg 
515 520 525 

Trp 



<210> 12 
<211> 90 
<212> PRT 

<213> Actinoplanes sp. 
<400> 12 

Met Ser Glu Thr Asp Leu Ser Ala Ala Arg His Thr Pro Glu Gin He 
15 10 15 

Arg Ser Trp Leu He Asp Arg He Ala Tyr Tyr Val Met Leu Pro Thr 
20 25 30 

Gin Glu He Glu Pro Asp Val Ser Leu Ala Glu Tyr Gly Leu Asp Ser 
35 40 45 

Val Tyr Ala Phe Ala Leu Cys Gly Glu He Glu Asp Thr Leu Gly He 
50 55 60 

Pro He Glu Pro Thr Leu Leu Trp Asp Val Asp Thr Val Ala Thr Leu 
65 70 75 80 



Thr Ala His Leu Ala Asp Arg Val Asn Arg 
85 90 



<210> 13 
<211> 1051 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc„f eature 
<222> (1) , . {1) 

<223> V represents a non-standard codon. It is expected that the biosy 
nthesized protein will have a f ormylmethionine residue at this po 
sition 



<400> 13 

Val Pro Thr Pro Asp Leu Arg Pro Leu Thr Pro Ala Gin Leu Ala Val 
15 10 15 

Trp His Ala Gin Gin Leu Ala Pro His Ser Pro Val Tyr Gin Val Gly 
20 25 30 

Glu Phe Val Glu lie Asp Gly Glu Cys Asp Pro Asp Leu Leu Val Ala 
35 40 45 

Ala Leu Arg Gin Val Met Gly Glu Ala Glu Ser Ala Arg Leu Arg Phe 
50 55 60 

Arg Val lie Asp Gly Thr Pro Trp Gin Tyr Val Ala Glu Asp Gly Asp 
65 70 75 80 

Asp Pro lie Gin Val Val Asp Leu Gly Ala Ala Ala Asp Pro Arg Ala 
85 90 95 

Ala Ala Leu Gly Arg Met Ala Ala Asp Leu Asp Arg Pro Gly Asp Leu 
100 105 110 

Arg Asp Gly Pro Leu Val Glu His His Val Tyr Leu Leu Gly Glu Gly 
115 120 125 

Arg Val lie Trp Tyr His Arg Ala His His lie Val Cys Asp Gly Gly 
130 135 140 

Ser Leu Gly lie Val Ala Ser Arg Val Ala Gly Val Tyr Ser Ala Leu 
145 150 155 160 

Ala Ala Gly Gly Asp Val Arg Pro Gly Ala Leu Pro Pro Leu Ser Val 
165 170 175 

Leu Leu Ser Ala Ala Asp Ala Tyr Glu Arg Ser Gly Asp Arg Asp Arg 
180 185 190 

Asp Arg Glu His Trp Arg Ser Ala Leu Ala Gly Leu Pro Ala Glu Leu 
195 200 205 

Leu Ala Gly Ala Gly Arg Pro Arg Pro Leu Pro Gly Pro Pro Val Arg 
210 215 220 

His Glu His Asp Leu Ser Ala Ala Glu Ala Gly Arg Leu Arg Ala Gly 
225 230 235 240 



Ala Arg Arg Leu Arg Thr Ser Val Ala Gin Ala Gly lie Ala Ala Ala 



245 250 255 

Ala Leu Tyr Gin His Arg Leu Thr Gly Ala Arg Asp Val Leu Val Ala 
260 265 270 

Val Pro Val Ala Gly Arg Thr Thr Arg Pro Glu Phe Asp Val Pro Gly 
275 280 285 

Met Thr Ser Asn Val Val Pro Val Arg Leu Ala Val Thr Pro Ala Thr 
290 295 300 

Thr Val Gly Glu Leu Leu Arg Asp Val Ala Arg Gly Val Arg Asp Gly 
305 310 ■ 315 320 

Leu Arg His Gin Arg Tyr Pro Tyr Pro Asn lie Val Asp Asp Leu Gly 
325 330 335 

Leu Ala Asp Arg Ala Ala Leu Arg Pro Val Thr Val Asn Ala Leu Ala 
340 345 350 

Leu Gly Arg Pro Leu Arg Phe Gly Ser Ala Val Gly Val Arg Ser Gly 
355 360 365 

Leu Ser Ala Gly Pro Val Asp Asp Val Thr He Gly Leu Tyr Glu Lys 
370 375 380 

Val Ser Gly Gly Gly Met Gin Thr He Ala Glu Leu Asn Pro Gly Arg 
385 390 395 400 

Thr Asp Arg Pro Asp Ala Ala Glu Val Ser Arg Trp Phe Arg Thr Leu 
405 410 415 

Leu Arg Gly Leu Ala Glu Ser Asp Ala Gly Asp Pro Val Ala Arg He 
420 425 430 

Asp He Val Asp Glu Pro Glu Arg Arg Arg Leu Leu Asp Glu Trp Asn 
435 440 445 

Ala Thr Ala Ala Pro Ser Ser Asp Thr Val Leu Ala Arg Phe Glu Glu 
450 455 460 

Gin Ala Ala Arg Thr Pro Glu Ala Pro Ala Val Val Cys Gly Asp Val 
465 470 475 480 

Thr Val Thr Tyr Ala Glu Leu Glu Ala Gly Ala Asn Arg Leu Ala Arg 
485 490 495 

Val Leu Arg Ala Arg Gly Ala Gly Pro Glu Ser Val Val Ala Leu Cys 
500 505 510 

Leu Pro Arg Gly Pro Glu Val Val Thr Gly He Leu Ala Ala Trp Lys 
515 520 525 

Ala Gly Ala Ala Tyr Leu Pro Val Asp Thr Glu Leu Pro Ala Glu Arg 
530 535 540 

Val Ala Tyr Leu Leu Gly Asp Ser Ala Ala Ala Val Arg Leu Gly Thr 
545 550 555 560 

Ala Glu Thr Leu Ala Ala Leu Pro Asp Gly Pro Ala Ala Asp Val Asp 
565 570 575 



68 



Val His Ala Pro 
580 

Glu Pro Leu Pro 
595 

Thr Gly Leu Ser 
610 

Tyr Val Gly Trp 
625 

His Ser Ser Leu 



Leu Val Cys Gly 
660 

Gly Leu Ala Ser 
675 

Val Val Pro Gly 
690 

Glu Met Ala Ala 
705 

Gly Gly Asp Val 



Val Asn Glu Tyr 
740 

Ser Val Ala Ala 
755 

Pro Val Ala Asn 
770 

Val Pro Ala Gly 
785 

Ala Arg Gly Tyr 



Ala Cys Pro Phe 
820 

Val Arg Leu Ala 
835 

Gin Val Lys He 
850 

Val Val Ala Gly 
865 

Pro Asp Ala Val 



Glu lie Ala Arg 



Asp Gin Leu Ala 
500 

Lys Gly Val Gly 
615 

Ala Ser Val Leu 
630 

Ala Phe Asp Leu 
645 

Gly Ser Val Val 



Leu Leu Ala Ala 
680 

His Leu Arg Leu 
695 

Val Gly Ser Leu 
710 

Arg Glu Trp Leu 
725 

Gly Pro Thr Glu 



Gly Asp Val Val 
760 

Thr Arg Leu Phe 
775 

Val Ala Gly Glu 
790 

Val Gly Arg Ser 
805 

Gly Val Gly Glu 



Gly Gly Asp Leu 
840 

Arg Gly Tyr Arg 
855 

His Pro Arg Val 
870 

Gly Glu Arg Gin 
885 



Glu Ser Pro Ser 
585 

Tyr Val He Tyr 



Val Ser His Gly 
620 

Tyr Gly Gly Leu 
635 

Thr Val Thr Ser 
650 

Val Ser Ala Ala 
665 

Gly Asp Gly Phe 



Leu Ala Glu Leu 
700 

Val Ala Gly Gly 
715 

Ser Arg Val Pro 
730 

Thr Val Val Gly 
745 

Gly Asp Val Val 



Val Leu Asp Glu 
780 

Leu Tyr Val Ala 
795 

Gly Leu Thr Ala 
810 

Arg Met Tyr Arg 
825 

Val Phe Val Gly 



Val Glu Pro Asp 
860 

Ala Gly Ala Ala 
875 

Leu Val Ala Tyr 

890 



Pro Leu Arg Leu 
590 

Thr Ser Gly Ser 
605 

Gly Leu Ala Asn 



Ser Ala Pro Leu 
640 

Val Phe Val Pro 
655 

Gly Gly Gly Arg 
670 

Ser Leu Val Lys 
685 

Val Pro Ala Gly 



Glu Val Leu Ala 
720 

Gly Ser Val Val 
735 

Cys Ser Val Phe 
750 

Pro Val Gly Arg 
765 

Gly Leu Arg Pro 



Gly Ser Gin Val 
800 

Ser Arg Phe Val 
815 

Thr Gly Asp Val 
830 

Arg Val Asp Glu 
845 

Glu Val Arg Leu 



Val Val Ala Arg 
880 

Val Val Ala Ala 
895 



Gly Glu Pro Ala Gly Leu Ala Glu Ser Val Arg Ala His Val Ala Glu 
900 905 910 

Arg Leu Pro Glu Tyr Met Val Pro Ala Ala Val Val Thr Leu Asp Glu 
915 920 925 

lie Pro Leu Thr Val Asn Gly Lys Val Asp Arg Ala Ala Leu Pro Glu 
930 935 940 

Pro Gly Pro Val Ala Thr Gly Asn Ala Asp Arg Glu Pro Thr Thr Glu 
945 950 955 950 

Arg Glu Ser Leu Leu Cys Gly Ala Phe Ala Asp Val Leu Gly lie Glu 
965 970 975 

Arg Val Gly Val Asp Asp Asp Phe Phe Ser Leu Gly Gly His Ser Leu 
980 985 990 

Leu Ala Thr Ser Leu Val Ser Arg Val Arg Leu Val Leu Gly Glu Glu 
995 1000 1005 

Leu Pro lie Glu Glu Leu Phe Ala Thr Pro Thr Pro Ala Glu Leu 
1010 1015 1020 

Ala Ala Trp Leu Gin Arg Asn Ala Asp Arg Pro Gin Pro Ala Arg 
1025 1030 1035 

Pro Ala Leu Arg Pro Met His Glu Arg Glu Thr Thr Ala 
1040 1045 1050 

<210> 14 
<211> 4999 
<212> PRT 

<213> Actinoplanes sp . 
<400> 14 

Met Thr Pro Met Ser Tyr Ala Gin Arg Arg Leu Trp Phe Gin Leu Arg 
15 10 15 

Val Glu Gly Pro Asp Ala Thr Tyr Asn Ser Pro Ala Val Leu Arg. Leu 
20 25 30 

Thr Gly Glu Leu Asp Thr Ala Ala Leu Glu His Ala Leu Arg Asp Val 
35 40 45 

Leu Glu Arg His Glu Val Leu Arg Thr Val Tyr Pro Asp Val Gly Gly 
50 55 60 

Glu Pro Arg Gin Arg Val Val Arg Pro Asp Asp Met Val Trp Glu Leu 
65 70 75 80 

Pro Thr Thr Arg Val Ser Gly Ala Gly Ala Gly Asp Asp Arg Leu Val 
85 90 95 

Thr Leu Asp Glu Leu Pro Trp Asp Arg Pro Val Leu Asp Leu Pro Ser 
100 105 110 



Pro Ala Pro Ala Gly Arg Glu Pro Asp Gly Glu lie Thr Val Asp Glu 
115 120 125 



Leu Pro Gly Ala He Ala Arg Val Ala Ala His Pro Phe Asp Leu Ser 
130 135 140 

He Glu He Pro Val Arg Ala Arg Leu Phe Ala Leu Gly Pro Arg His 
145 150 155 160 

His val Leu Val Val Val Leu His His He Ala Thr Asp Gly Ser Ser 
165 170 175 

Gly Gly Pro Phe Ala Arg Asp Leu Ala Ala Ala Tyr Arg Ala Arg Arg 
180 185 190 

Thr Gly Thr Ala Pro Gin Trp Ala Pro Leu Pro Val Gin Tyr Ala Asp 
195 200 205 

Tyr Ala Ala Trp Gin Gin Glu Leu Leu Gly Ala Glu Asp Asp Pro Asp 
210 215 220 

Ser Val He Ser Arg Gin Leu Ala His Trp Gin Glu Arg Leu Ala Gly 
225 230 235 240 

Met Pro Val Glu Leu Asp Leu Pro Ala Asp Arg Pro Arg Pro Ala Glu 
245 250 255 

Pro Gly His Gly Gly His Thr Lys Ala Leu Ser Leu Pro Pro Ala Val 
260 265 270 

His Arg Gly Leu Ala Thr Leu Ala Arg Arg Arg Arg Ala Thr Leu Gin 
275 280 285 

Met Val Val Gin Thr Gly Val Ala He Leu Leu Ser Lys Leu Gly Ala 
290 295 300 

Gly Arg Asp Val Pro Leu Gly He Pro Val Ala Gly Arg Thr Asp Ala 
305 310 315 320 

Ala Leu Asp Asp Leu He Gly Phe Phe Val Asn Thr Leu Val Val Arg 
325 330 335 

Ala Asp Leu Ser Gly Asp Pro Thr Val Ala Asp Ala Leu Gly Arg Val 
340 345 350 

Arg Gly Gly Ala Val Ala Ala Leu Ala Asp Gin Asp Val Pro Phe Asp 
355 360 365 

Lys Leu Val Glu Arg Leu Ala Pro Ala Arg Val Leu Gly Arg His Pro 
370 375 380 

Leu Phe Gin Val Met Val Ala Pro Leu Asp Asp Gly Thr Pro He Asp 
385 390 395 400 

Leu Asp Gly Val Arg Gly Glu Pro Leu Thr He Gly Arg Ser Gly Ala 
405 410 415 

Lys Phe Asp Val Glu Val Met Thr Gly Glu Val Arg Ala Ala Asp Gly 
420 425 430 

Ala Pro Ala Gly He Arg Gly He Leu Thr Leu Ser Ala Asp Leu Phe 
435 440 445 

Asp Glu Ala Thr Ala Gly Arg Met Ala Ala Gly Leu Val Arg Val Leu 



450 



455 460 



Thr Ala Met Ala Glu Ala Pro Glu Arg Arg Leu Ser Gly He Glu Val 
465 470 475 480 

Leu Ser Pro Gly Glu Arg Ser Arg Leu Leu Val Glu Trp Asn Asp Thr 
485 490 495 

Ala Arg Pro Val Val Glu Ser Ser Val Pro Ala Leu Phe Ala Lys Arg 
500 505 510 

Val Ala Ala Thr Pro Asp Ala Thr Ala Val Val Gly Glu Gly Val Ser 
515 520 525 

Trp Ser Tyr Arg Glu Leu Asp Arg Arg Ser Asp Val Leu Ala Arg Arg 
530 535 540 

Leu Val Ala Ala Gly Val Gly Val Glu Ser Pro Val Val Val Ala Leu 
545 550 555 560 

Glu Arg Ser Pro Glu Val Leu Ser Ala Phe Leu Ala Val Ala Lys Ala 
565 570 575 

Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp Pro Gin Ala Arg Val 
580 585 590 

Asp Ala Val Val Ala Asp Cys Ala Ala Arg Val Ala Val Ala Asp Arg 
595 600 605 

Pro Met Ser Gly Leu Thr Val Val Ser Ala Gly Leu Gly Gly Asp Ser 
610 615 620 

Ala Val Val Ser Ala Asp Leu Thr Ala Asp Arg Ala Val Val Leu Pro 
625 630 635 640 

Ser Arg Pro Val Pro Gly Ala Ala Val Tyr Arg Met Tyr Thr Ser Gly 
645 650 655 

Ser Thr Gly Arg Pro Lys Gly Val Val Thr Thr His Gin Asn Leu Val 
660 665 670 

Asp Leu Ala Thr Asp Thr Cys Trp Gly Pro Thr Pro Arg Val Leu Phe 
675 680 685 

His Ala Pro His Ala Phe Asp Ala Ser Ser Tyr Glu He Trp Val Pro 
690 695 700 

Leu Leu Asn Gly Gly Thr Val Val Val Ala Pro Gin Arg Ser He Asp 
705 710 715 720 

Ala Thr Val Leu Lys Asp Leu He Arg Ala His Asp Leu Thr His Val 
725 730 735 

His Val Thr Ala Gly Leu Leu Arg Val Leu Asp Pro Ser Cys Phe Ala 
740 745 750 

Gly Leu Thr Glu Val Leu Thr Gly Gly Asp Ala Val Ser Ala Glu Ala 
755 760 765 

Val Arg Arg Val Lys Asp Ala Asn Pro Gly Leu Arg Val Arg Gin Leu 
770 775 780 



Tyr Gly Pro Thr Glu Val Thr Leu Cys Ala Thr Gin His Leu Leu Asp 
785 790 795 800 



Asp Gly Val Pro lie Gly Arg Pro Leu Asp Asn Thr Arg Val Tyr Val 
805 810 815 

Leu Asp Asp Leu Leu Gin Pro Val Pro Val Gly Val Thr Gly Glu Leu 
820 825 830 

Tyr Val Ala Gly Ala Gly Val Ala Arg Gly Tyr Ala Gly Met Pro Gly 
835 840 845 

Leu Thr Ala Glu Arg Phe Val Ala Asp Pro Phe Asn Thr Gly Gly Arg 
850 855 860 

Leu Tyr Arg Thr Gly Asp Leu Val Arg Trp Thr Asp Asp Gly Val Leu 
865 870 ' 875 880 

His Phe Ala Gly Arg Ala Asp Asp Gin Val Lys lie Arg Gly Tyr Arg 
885 890 895 

Val Glu Pro Gly Glu Val Glu Ala Val Leu Ala Gin His Pro Asp Val 
900 905 910 

Ser Gin Val Ala Val Val Val Arg Glu Asp Thr Pro Gly Asp Lys Arg 
915 920 925 

Leu Val Ala Tyr Val Val Gly Gly Asp lie Glu Ala Tyr Gly Gin Glu 
930 935 940 

Arg Leu Pro Gly Tyr Met Val Pro Ser Ala Phe Val His Leu Asp Ala 
945 950 955 960 

Leu Pro Leu Thr Ser Asn Gin Lys Val Asp Arg Ala Ala Leu Pro Ala 
965 970 975 

Pro Ser Met Glu Ser Gly Ala Gly Arg Ala Pro Ala Asp Ala Arg Glu 
980 985 990 

Glu Leu Val Cys Ala Ala Phe Ala Glu Val Leu Gly Leu Asp Arg Val 
995 1000 1005 

Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu Leu 
1010 1015 1020 

Ala Val Ser Leu Val Glu Asp Leu Arg Gin Arg Gly Leu His Val 
1025 1030 103'5 

Ser Val Arg Ala Leu Phe Ala Thr Pro Thr Pro Ala Ala Leu Ala 
1040 1045 1050 

Val Ser Thr Val Ala Ala Pro He Glu Val Pro Pro Asn Leu He 
1055 1060 1065 

Pro Gin Gly Gly Ala Arg Glu Leu Thr Pro Asp Met Leu Pro Leu 
1070 1075 1080 



Val Asp Leu Thr Gly Glu Glu Leu Ala Thr He Val Ala Ala Val 
1085 1090 1095 



Pro Gly Gly Ala Ala Asn lie Ala Asp lie Tyr Pro Leu Ala Pro 
1100 1105 1110 



Leu Gin Glu Gly He Phe Phe His His Leu Met Thr Glu Gly Asp 
1115 1120 1125 

Thr Ala Asp Val Tyr Ala Leu Pro Tyr Leu Leu Arg Val Gly Thr 
1130 1135 1140 

Arg Glu Gin Leu Asp Ala Phe Leu Gly Ala Leu Gin Gin Val Val 
1145 1150 1155 

Asp Arg His Asp Val Tyr Arg Thr Ala He Ala Trp Gin Asn Leu 
1160 1165 1170 

Arg Glu Pro Val Gin Val Val His Arg His Ala Thr Leu Pro Val 
1175 1180 1185 

Thr Glu Val Thr Pro Asp Gin Leu His Ala Ala Ala Thr Gly Gly 
1190 1195 1200 

Arg Leu Pro Leu Asp His Ala Pro Leu Leu Ser Val His He Ala 
1205 1210 1215 

Pro Glu Pro Asp Gly Gly Trp Leu Ala Leu Leu Arg Met His His 
1220 1225 1230 

Leu Val Gin Asp His Thr Ala Leu Asp He Val Leu Asp Glu He 
1235 1240 1245 

Arg Thr He Leu Ala Gly Ala Thr Asp His Leu Pro Pro Pro Val 
1250 1255 1260 

Pro Phe Arg Asn Phe Val Ala Arg Ser Arg Arg Gly Ala Ala Glu 
1265 1270 1275 

Ala Ala His Arg Asp Tyr Phe Thr Gly Leu Leu Gly Asp Val Thr 
1280 1285 1290 

Glu Thr Thr Ala Pro Tyr Gly Leu Thr Asp Val His Gly Glu His 
1295 1300 1305 

Ser Gly Val Arg Arg Gly Arg Leu Ala Val Ser Ala Gly Leu Ala 
1310 1315 1320 

Gly Arg Val Arg Glu Thr Ala Arg Asp Arg Gly Val Ser Pro Ala 
1325 1330 1335 

Thr Leu Phe His Leu Ala Trp Ala Arg Val Leu Ala Ala Val Ser 
1340 1345 1350 

Gly Arg Asp Asp Val Val Phe Gly Thr Val Leu Leu Gly Arg Met 
1355 1360 1365 

Asp Ala Gly Pro Gly Ala Asp Arg Val Pro Gly Leu Phe Met Asn 
1370 1375 . 1380 

Thr Leu Pro Val Arg Val Arg Leu Gly Gly Arg Thr Val Asp Glu 
1385 1390 1395 



Ala Leu His Gly Met Arg Ala Gin Leu Ala Asp Leu Leu Thr His 



1400 



1405 



1410 



Glu His Ala Pro Leu Val Leu Ala Gin Gin Ser Ala Gly Leu Pro 
1415 1420 1425 

Gly Gly Ser Pro Leu Phe Thr Ser Leu Phe Asn Tyr Arg His Asn 
1430 1435 1440 

Ala Thr Asp He Glu Arg Ser Gly Thr Gly lie Asp Gly Val Glu 
1445 1450 1455 

Ala Leu Pro Thr Gly Asp Pro Ser Asn Tyr Pro Leu Asp Val Ser 
1460 1465 1470 

Val Asn Gin Ser Pro Leu Gly Phe Glu Leu Val Val Glu Ala Thr 
1475 1480 1485 

Glu Pro Ala Asp Pro Asp Gin Leu Cys Arg Leu Leu His Ala Cys 
1490 1495 1500 

Leu Asp Asp Leu He Ala Ala Leu Asp Glu Gin Pro Gly Arg Ala 
1505 1510 1515 

Leu Gly Thr Leu Asp Val Val Ala Gly Arg Glu Arg Asp Leu Leu 
1520 1525 1530 

Leu Asp Gly Trp Asn Ala Thr Ala Val Pro Ala Gin Pro Ala Leu 
•1535 1540 1545 

Val Pro Glu Leu Phe Thr Ala Gin Ala Ala Arg Thr Pro Thr Trp 
1550 1555 1560 

Pro Ala Leu Val Thr Ala Gly Ala Glu Met Ser Tyr Ala Glu Leu 
1565 1570 1575 

Glu Glu Arg Ser Asn Arg Leu Ala Arg Trp Leu Ala Gly Arg Gly 
1580 1585 1590 

Val Gly Ala Asp Asp Arg Val Ala Leu Met Met Arg Arg Gly Pro 
1595 1600 1605 

Glu Leu Met Val Ala He Leu Ala Val Leu Lys Ala Gly Ala Ala 
1610 1615 1620 

Tyr Leu Pro Val Asp Pro Asp Leu Pro Arg Asp Arg Val Asp Tyr 
1625 1630 1635 

Leu Leu Ala Asp Ala Ala Pro Ala Phe Val Leu Ala Glu Arg Ala 
1640 1645 1650 

Thr Ala Pro Trp Val Pro Val Ala Gly Gly He Pro Val Leu Val 
1655 1660 1665 

Val Asp Ala Pro Ala Val Ala Ala Glu Val Ala Ala His Ser Gly 
1670 1675 1680 

Glu Ala Val Thr Asp Arg Asp Arg Arg Ala Ala Leu Arg Gly Gly 
1685 1690 1695 

His Leu Ala Tyr Val He Tyr Thr Ser Gly Ser Thr Gly Arg Pro 
1700 1705 1710 



Lys Gly Val Leu He Thr His Asp Gly Leu Ala Asn Leu Thr Leu 
1715 1720 1725 

Asp His Gly Arg Phe Gly Leu Giy Pro Gly Ala Arg Val Ala Gin 
1730 1735 1740 

Phe Ala Ser Pro Gly Phe Asp Met Phe Val Asp Glu Trp Ser Met 
1745 1750 1755 

Ala Leu Leu Ala Gly Ala Ala Leu Thr Phe Val Pro Pro Glu Arg 
1760 1765 1770 

Arg Leu Gly Ala Asp Leu Ala Ala Phe Leu Ala Glu Tyr Gly Val 
1775 1780 1785 

Thr His Ala Thr Leu Pro Pro Ala Val Val Gly Thr Pro Asp Gly 
1790 1795 1800 

Val Leu Pro Pro Ser Phe Val Leu Asp Val Gly Gly Asp Val Leu 
1805 1810 1815 

Pro Gly Asp Leu Ala Arg Arg Trp Leu Arg Asp Gly Arg Val Leu 
1820 1825 1830 

Phe Asn Ser Tyr Gly Pro Thr Glu Thr Thr Val Asn Ala Ala Thr 
1835 1840 1845 

Trp Arg Ala Glu Ala Gly Asp Trp Gly Ser Val Ala Pro He Gly 
1850 1855 I860 

Thr Pro Val Pro Asn Leu Arg Ala Tyr Val Leu Asp Gly Trp Leu 
1865 1870 1875 

Arg Pro Val Pro Val Gly Ala Asp Gly Glu Leu Tyr Val Ser Gly 
1880 1885 1890 

Ala Gly Leu Ala Arg Gly Tyr Leu Asn Arg Ala Gly Leu Thr Ala 
1895 1900 1905 

Glu Arg Phe Val Ala Cys Pro Phe Glu Pro Gly Glu Arg Met Tyr 
1910 1915 1920 

Arg Thr Gly Asp Val Val Arg Trp Thr Ala Glu Gly Arg Leu Val 
1925 1930 1935 

Phe Ala Gly Arg Ser Asp Asp Gin Val Lys He Arg Gly Phe Arg 
1940 1945 1950 

He Glu Pro Gly Glu Val Glu Ala Val Leu Ala Ala Gly Pro Gly 
1955 I960 1965 

Val Ser Gin Ala Ala Val He Val Arg Glu Asp Val Pro Gly Asp 
1970 1975 1980 

Lys Arg Leu Val Ala Tyr Val Val Gly Gly Asp Val Glu Ala Leu 
1985 1990 1995 

Arg Ser Tyr Ala Gin Gin Arg Leu Pro Gly Tyr Met Val Pro Ser 
2000 2005 2010 



Ala Phe Val Glu Leu Asp Arg Leu Pro Leu Thr Val Asn Gly Lys 
2015 2020 2025 



Leu Asp Arg Arg Ala Leu Pro Val Pro Asp Leu Ala Arg Gly Thr 
2030 2035 2040 

Gly Ser Gly Arg Pro Ala Gly Thr Pro Arg Glu Gin Leu Leu Cys 
2045 2050 2055 

Ala Gly Phe Ala Ala Val Leu Gly Val Asp Asp Val Gly Ala Asp 
2060 2065 2070 

Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu Leu Val Val Ser 
2075 2080 2085 

Leu Val Glu Trp Leu Arg Arg Arg Gly Val Ser Val Pro Val Arg 
2090 2095 2100 

Ala Leu Phe Thr Thr Pro Thr Pro Ala Gly Leu Ala Glu Ala Val 
2105 2110 2115 

Gly Asp Gly Ala Val Val Val Pro Pro Asn Leu lie Pro Glu Gly 
2120 2125 2130 

Ala Ala Glu Leu Thr Pro Glu Met Val Pro Leu Ala Asp Leu Thr 
2135 2140 2145 

Ser Glu Glu Leu Ala lie Val Val Ala Ser Val Pro Gly Gly Ala 
2150 2155 2150 

Ala Asn Val Ala Asp Val Tyr Pro Leu Ala Pro Leu Gin Glu Gly 
2165 2170 2175 

lie Phe Phe Pro Val Ala Thr Gly Pro Gin Cys Tyr Ala Thr Val 
2180 2185 2190 

Gly Ser Ser Leu Pro Asp Asp Gly Gly Ser Ala Pro Cys Ser Arg 
2195 2200 2205 

Phe Arg Arg Arg Cys Val Ser Thr Ser Val Val Trp Gin Gly Leu 
2210 2215 2220 

Arg Glu Pro Val Gin Val Val Trp Arg His Ala Arg Leu Pro Val 
2225 2230 2235 

Glu Glu Val Val Leu His Glu Gly Ala Asp Pro Val Glu Gin Met 
2240 2245 2250 

Met Ala Leu Ala Gly Gly Trp Met Asp Leu Thr Arg Ala Pro Leu 
2255 2260 2265 

lie Asp Val His He Ala Ala Gly Pro Gly Gly Asp Arg Trp Leu 
2270 2275 2280 

Ala Val Leu Arg He His His Leu Val Gin Asp His Thr Ala Leu 
2285 2290 2295 

Glu Thr Leu Leu Asp Glu Leu Gin Ser Phe Leu Glu Gly Arg Gly 
2300 2305 2310 



Gly Glu Leu Ala Glu Pro Val Pro Phe Arg Glu Phe Val Ala Gin 



2315 



2320 



2325 



Ala Arg Leu Gly Val Pro Arg Glu Glu His Glu Arg Tyr Phe Ala 
2330 2335 2340 

Glu Leu Leu Gly Asp lie Thr Glu Thr Thr Ala Pro Tyr Asp Leu 
2345 2350 2355 

Thr Asp Val His Gly Asp Gly Thr Gly Tyr Asp His Gly Ala Leu 
2360 2365 2370 

Pro Leu Asp Ala Thr Val Ala Ala Arg Val Arg Glu Ala Ala Arg 
2375 2380 2385 

Thr Leu Gly Val Ser Pro Ala Thr Leu Phe His Leu Ala Trp Ala 
2390 2395 2400 

Arg Val Leu Gly Thr Leu Ala Gly Arg Asp Asp Val Val Phe Gly 
2405 2410 2415 

Thr Val Leu Phe Gly Arg Met Asn Ser Gly Ala Gly Ala Asp Arg 
2420 2425 2430 

Val Ser Gly Leu Phe lie Asn Thr Leu Pro Val Arg Val Arg Leu 
2435 2440 2445 

Gly Ala Pro Thr Gly Asp Ala Leu Gly Asp Leu Arg Asp Gin Leu 
2450 2455 2460 

Ala Glu Leu Leu Val His Glu His Ala Ser Leu Ala Ser Ala Gin 
2465 2470 2475 

Lys Ala Ser Gly Leu Pro Gly Gly Ser Pro Leu Phe Thr Ser lie 
2480 2485 2490 

Phe Asn Tyr Arg His Asn Gin Val Ser Ala Glu Arg Glu Thr Ala 
2495 2500 2505 

Ala Leu Pro Gly lie Arg Val Leu Ala Ala Arg Asp Ser Thr Asn 
2510 2515 2520 

Tyr Pro Leu Thr Val Ala Val Asp Asp Asp Gly His Gly Phe Thr 
2525 2530 2535 

Leu Val Val Glu Val Ala Ser Thr Val Asp Ala Ala Gly Val Cys 
2540 2545 2550 

Glu Leu Leu His Thr Ala Val Asp Asn Leu lie Ala Ala Leu Thr 
2555 2560 2565 

Asp .Arg Pro Gly Gly Pro Leu Ala Glu Val Asp lie Leu Glu Arg 
2570 2575 2580 

Gly Leu Arg Asp Arg Leu Leu Thr Ala Trp Asn Glu Ala Arg Glu 
2585 2590 2595 

Pro Ala Pro Pro Val Thr Leu Pro Asp Leu Phe Asp Arg Gin Ala 
2600 2605 2610 

Arg Arg Thr Pro Glu Ala Val Ala Leu Thr Ala Asp Gly Val Ser 
2615 2620 2625 



Leu Thr Tyr Arg Glu Leu Ser Glu Arg Ala Asn Arg lie Ala Arg 
2630 . 2635 2640 

Leu Leu Thr Ser Arg Gly He Gly Pro Glu Ser Leu Val Gly Val 
2645 2650 2655 

Val Leu Pro Arg Ser Ala Asp Leu Val Val Ala Leu Leu Gly Val 
2660 2665 2670 

Leu Gin Ala Gly Ala Ala Tyr Val Pro Val Asp Ala Asp Tyr Pro 
2675 2680 2685 

Ala Glu Arg He Gly Tyr He Leu Gly Asp Ala Gly Ala Val Cys 
2690 2695 2700 

Val Leu Thr Val Asp Ala Thr Ala Gly Ala Val Pro Pro Gly Val 
2705 2710 2715 

Pro Lys Leu Val Leu Asp His Pro Glu Thr Val Thr Ala Leu Ala 
2720 2725 2730 

Ala Cys Asp Thr Ala Pro Leu Gly Glu Ala Glu Arg Ala Gly Glu 
2735 2740 2745 

Leu Leu Pro Glu His Pro Ala Tyr Val He Tyr Thr Ser Gly Ser 
2750 2755 2760 

Thr Gly Thr Pro Lys Gly Val Leu He Pro His Arg Asn Val Val 
2765 2770 2775 

Glu Leu Phe Ala Ala Thr Arg Gly Ser Phe His Phe Gly Glu Gly 
2780 2785 2790 

Asp Val Trp Ser Trp Phe His Ser Val Ala Phe Asp Phe Ser Val 
2795 2800 2805 

Trp Glu Leu Trp Gly Ala Leu Leu His Gly Gly Arg Val Val Met 
2810 2815 2820 

Val Pro Phe Ala Val Ser Arg Ser Pro Arg Asp Phe Trp Glu Leu 
2825 2830 2835 

Leu Val Arg Glu Arg Val Thr Val Leu Ser Gin Thr Pro Ser Ala 
2840 2845 2850 

Phe Tyr Gin Leu Ala Ala Ala Ala Asp Asp Thr Pro Asp Ala Leu 
2855 2860 2865 

Arg Val Val Val Phe Gly Gly Glu Ala Leu Asp Pro Gly Arg Leu 
2870 2875 2880 

Ala Gly Trp Arg Glu Arg Arg Pro Asp Gly Pro Arg Leu Val Asn 
2885 2890 2895 

Met Tyr Gly He Thr Glu Thr Thr Val His Val Thr His Gin Asp 
2900 2905 2910 

Leu Ala Pro Ala Asp Thr Thr Gly Ser Pro He Gly Arg Gly He 
2915 2920 2925 



Pro Gly Leu Ser Val Tyr Val Leu Asp Glu Ala Leu Arg Pro Val 
2930 2935 2940 



Pro Pro Gly Val Ala Gly Glu Val Tyr Val Ala Gly Arg Gin Leu 
2945 2950 2955 

Ala Arg Ala Tyr Leu Gly Arg Ala Ala Leu Thr Gly Thr Arg Phe 
2960 2965 2970 

Val Ala Cys Pro Phe Leu Pro Ala Gly Glu Arg Met Tyr Arg Thr 
2975 2980 2985 

Gly Asp Arg Ala Arg Ti;p Ser Arg Gly Arg Leu Gin Phe Ala Gly 
2990 2995 3000 

Arg Thr Asp Asp Gin Val Gin lie Arg Gly Phe Arg lie Glu Pro 
3005 3010 3015 

Gly Glu Val Gin Ala Val Val Ala Ala His Pro Glu He Ala Ala 
3020 3025 3030 

Ala Ala Val Val Val Arg Glu Asp Val Pro Gly Asp Pro Arg Leu 
3035 3040 3045 

Thr Ala Tyr Val Val Pro Ala Gly Pro Arg Thr Ala Pro Ala Ala 
3050 3055 3050 

Val Ala Glu Thr Val Arg Arg Phe Ala Ala Asp Arg Leu Pro Ala 
3055 3070 3075 

Tyr Met Leu Pro Ser Ala Val Val Val Leu Asp Ala Leu Pro Leu 
3080 3085 3090 

Thr Asp His Gly Lys Leu Asp Arg Arg Ala Leu Pro Ala Pro Gin 
3095 3100 3105 

His Thr Gly Ala Ala Ser Gly Arg Ala Pro Ala Thr Val Ala Glu 
3110 3115 3120 

Glu Val Leu Cys Ala Ala Phe Ala Glu Val Leu Gly Val Glu Arg 
3125 3130 3135 

Val Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu 
3140 3145 3150 

Leu He Val Ser Leu Val Glu Arg Val Arg Arg Ala Gly Leu Ala 
3155 3160 3165 

He Pro Val Arg Ala Leu Phe Arg Ser Ala Thr Pro Ala Gly Leu 
3170 3175 3180 

Ala Ala Leu Ala Arg Pro Tyr Arg Val Asp He Pro Pro Asn Leu 
3185 3190 ^ 3195 

Val Pro Asp Gly Ala Arg Glu He Thr Pro Asp Met Leu Thr Leu 
3200 3205 3210 

Ala Ala Leu Thr Glu Ala Glu He Ala Thr Val Leu Ala Thr Val 
3215 3220 3225 



Pro Gly Gly Ala Val Asn Val Ala Asp He Tyr Pro Leu Ala Pro 



3230 

Leu Gin Glu Gly He 
3245 

Arg Ala Asp Ala Tyr 
3260 

Ala Glu Arg Leu Asp 
3275 

Asp Arg Asn Asp He 
3290 

Arg Glu Pro Val Gin 
3305 

Glu Glu Val Ala Leu 
3320 

Leu Ala Ala Ala Gly 
3335 

He Arg Ala His Val 
3350 

Leu Leu Arg He His 
3365 

Val Val Leu Gly Glu 
3380 

Glu Leu Ala Glu Pro 
3395 

Arg Leu Gly Val Pro 
3410 ^ 

Leu Leu Gly Asp Val 
3425 

Asp Val His Gly Asp 
3440 

Val Ala Glu Ala Leu 
3455 

Leu Gly Val Ser Pro 
3470 

Val Leu Ser Val He 
3485 

lie Leu Phe Gly Arg 
3500 

Pro Gly Leu Phe He 
3515 

Gly Thr Ser Val Gly 
3530 



3235 

Phe Phe His His Leu 
3250 

Ala Met Pro Tyr Val 
3265 

Val Leu Leu Gly Ala 
3280 

Tyr Arg Thr Gly Val 
3295 

Val Val Trp Arg Ser 
3310 

Asp Gly Gly His Asp 
3325 

Glu Glu Phe Asp Leu 
3340 

Ala Ala His Pro Asp 
3355 

His Leu Val Gin Asp 
3370 

Leu Arg Ala Phe Leu 
3385 

Val Pro Phe Arg Glu 
3400 

Arg Glu Glu His Glu 
3415 

Thr Glu Thr Thr Ala 
3430 

Gly Ser Arg Ala Val 
3445 

Ala Val Arg Val Arg 
3460 

Ala Thr Val Phe His 
3475 

Ala Gly Arg Asp Asp 
3490 

Met Asn Ser Gly Ala 
3505 

Asn Thr Leu Pro Val 
3520 

Glu Ala Leu Thr Ala 
3535 



3240 

Met Ala Asp Ala Gly 
3255 

Leu His Leu Asp Thr 
3270 

Leu Gin Arg Val He 
3285 

Val Ser Ala Gly Leu 
3300 

Ala Val Leu Pro Val 
3315 

Pro Val Glu Gin Leu 
3330 

Thr Arg Ala Pro Leu 
3345 

Gly Gly Arg Leu Leu 
3360 

His Thr Thr Phe Asp 
3375 

Glu Gly Arg Gly Gly 
3390 

Phe Val Ala Gin Ala 
3405 

Arg Tyr Phe Ala Glu 
3420 

Pro Tyr Gly Leu Thr 
3435 

Gin Val Ser Leu Pro 
3450 

Glu Val Ala Arg Thr 
3465 

Leu Ala Trp Ala Arg 
3480 

Val Val Phe Gly Thr 
3495 

Ala Ala Glu Arg Val 
3510 

Arg Val Arg Leu Asn 
3525 

Leu Arg Asp Gin Met 
3540 



Ala Glu Leu Met Ala His Glu His Ala Pro Leu Ala Leu Ala Gin 
3545 3550 3555 

Arg Ala Gly Gly Val Pro Ala Gly Ser Pro Leu Phe Thr Ser Leu 
3560 3565 3570 

Phe Asn Tyr Arg His Asn Val Ala Gly Gly Gly Asp Gly Gly Ala 
3575 3580 3585 

Leu Glu Gly Val Thr Pro Val Leu His Arg Asp Thr Thr Asn Tyr 
3590 3595 3600 

Pro Val Val Val Ser Val Asp Asp Asp Gly Thr Ser Phe Asp Leu 
3605 3610 3615 

Val Val Glu Ala Val Ala Pro Ala Glu Ala Gly Arg Val Gly Arg 
3620 3625 3630 

Leu Met His Glu Cys Leu Ala Glu Leu Val Gly Ala Leu Ala Gly 
3635 3640 , 3645 

Ala Pro Glu Thr Pro Leu Ser Arg Val Arg Val lie Asp Glu Ala 
3650 3655 3660 

Glu He Glu Arg Val Val His Ser Trp Asn Asp Thr Ala Arg Pro 
3665 3670 3675 

Val Val Glu Ser Ser Val Pro Ala Leu Phe Ala Glu Gin Val Ala 
3680 3685 3690 

Ala Ala Pro Asp Ala Thr Ala Val Val Gly Glu Gly Val Ser Trp 
3695 3700 3705 

Ser Tyr Arg Glu Leu Asp Ala Arg Ser Asp Ala Leu Ala Arg Ser 
3710 3715 3720 

Leu Val Ala Ala Gly Val Gly Val Glu Ser Pro Val Val Val Ala 
3725 3730 3735 

Leu Glu Arg Ser Pro Glu Val Leu Ser Ala Phe Leu Ala Val Ala 
3740 3745 3750 

Lys Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp Pro Gin 
3755 3760 3765 

Ala Arg He Asp Ala Val Val Ala Asp Cys Ala Ala Arg Val Ala 
3770 3775 3780 

Val Ala Asp Arg Pro Met Ser Gly Leu Thr Val Val Pro Ala Asp 
3785 3790 3795 

Gin Val Gly Asp Ser Ala Val Val Leu Pro Ala Gly Pro Val Pro 
3800 3805 3810 

Gly Ala Ala Val Tyr Arg Met Tyr Thr Ser Gly Ser Thr Gly Arg 
3815 3820 3825 

Pro Lys Gly Val Val Thr Thr His Gin Asn Leu Val Asp Leu Ala 
3830 3835 3840 



Thr Asp Thr Cys Trp Gly Pro Thr Pro Arg Val Leu Phe His Ala 
3845 3850 3855 

Pro His Ala Phe Asp Ala Ser Ser Tyr Glu He Trp Val Pro Leu 
3860 3865 3870 

Leu Asn Gly Gly Thr Val Val Val Ala Pro Gin Arg Ser He Asp 
3875 3880 3885 

Ala Thr Val Leu Arg Asp Leu He Arg Gly His Glu Leu Thr His 
3890 3895 3900 

Val His Val Thr Ala Gly Leu Leu Arg Val Leu Asp Pro Ser Cys 
3905 3910 3915 

Phe Ala Gly Leu Thr Glu Val Leu Thr Gly Gly Asp Ala Val Ser 
3920 3925 3930 

Ala Glu Ala Val Arg Arg Val Arg Glu Ala Asn Pro Gly Leu Arg 
3935 3940 3945 

Val Arg Gin Leu Tyr Gly Pro Thr Glu Val Thr Leu Cys Ala Thr 
3950 3955 3960 

Gin His Leu Leu Val Asp Gly Val Pro He Gly Arg Pro Leu Asp 
3965 3970 3975 

Asn Thr Arg Val Tyr Val Leu Asp Asp Leu Leu Gin Pro Val Pro 
3980 3985 3990 

Val Gly Val Thr Gly Glu Leu Tyr Val Ala Gly Ala Gly Leu Ala 
3995 4000 4005 

Arg Gly Tyr Ala Gly Met Pro Gly Leu Thr Ala Glu Arg Phe Val 
4010 4015 4020 

Ala Asp Pro Phe Ser Val Gly Gly Arg Leu Tyr Arg Thr Gly Asp 
4025 4030 4035 

Leu Val Arg Trp Thr Asp Asp Gly Val Leu His Phe Ala Gly Arg 
4040 4045 4050 

Ala Asp Asp Gin Val Lys He Arg Gly Tyr Arg Val Glu Pro Gly 
4055 4060 4065 

Glu Val Glu Ala Val Leu Ala Gin His Pro Asp Val Ser Gin Val 
4070 ^ 4075 4080 

Ala Val Val Val Arg Glu Asp Thr Pro Gly Asp Lys Arg Leu Val 
4085 4090 4095 

Ala Tyr Val Val Gly Gly Asp Val Glu Ala Tyr Ala Gin Glu Arg 
4100 4105 4110 

Leu Pro Gly Tyr Leu Val Pro Ser Ala Phe Val His Leu Asp Ala 
4115 4120 4125 

Leu Pro Leu Thr Ser Asn Gin Lys Val Asp Arg Ala Ala Leu Pro 
4130 4135 4140 

Ala Pro Ser Val Glu Ser Gly Val Gly Arg Ala Pro Ala Asp Ala 



4145 4150 4155 

Arg Glu Glu Leu Met Cys Ala Ala Phe Ala Glu Val Leu Asp Leu 
4160 4165 4170 

Asp Arg Val Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly His 
4175 4180 4185 

Ser Leu Leu Val Val Arg Leu Val Gly Arg He Arg Gin Val Phe 
4190 4195 4200 

Gly Val Glu Val Ser Ala Arg Leu Val Phe Asp Ala Arg Thr Pro 
4205 4210 4215 

Ala Gly Val Val Ala Arg Leu Ser Glu Gly Gly Thr Ala Arg Glu 
4220 4225 4230 

Ala Val Arg Ala Arg Val Arg Pro Ala Arg Val Pro Leu Ser Phe 
4235 4240 4245 

Ala Gin Arg Arg Leu Trp Phe Leu Ser Gin Leu Glu Gly Pro Ser 
4250 4255 4260 

Ala Thr Tyr Asn He Pro Val Ala Leu Arg Leu Asp Gly Pro Leu 
4265 4270 4275 

Asp Arg Asp Ala Leu Thr Ala Ala Leu His Asp Val Val Ala Arg 
4280 4285 4290 

His Glu Val Leu Arg Thr Val Phe Thr Val Ala Asp Gly Glu Pro 
4295 4300 4305 

Trp Gin Gin He Leu Asp Asp Pro Gin Val Ser Val Pro Val Val 
4310 4315 4320 

Glu Val Thr Pro Asp Arg Leu Pro Glu Ala Val Ala Val Ala Ala 
4325 4330 4335 

Gly His Arg Phe Asp Leu Gly Arg Glu Leu Pro Leu Arg Ala Val 
4340 4345 4350 

Leu Leu Ala Thr Gly Asp Asp Val His Val Leu Val Leu Val Val 
4355 4360 4365 

His His lie Ala Ala Asp Gly Trp Ser Met Arg Pro Leu Ala Arg 
4370 4375 4380 

Asp Leu Ala Ala Ala Tyr Ala Ala Arg He Asp Ala Thr Ala Pro 
4385 4390 4395 

Ala Leu Gly Ala Leu Pro Val Gin Tyr Ala Asp Tyr Ala Leu Trp 
4400 4405 4410 

Gin Arg Asp Val Leu Gly Ser Glu His Asp Pro Asp Ser Val He 
4415 4420 4425 

Ser Gin Gin Val Ala Tyr Trp Arg Arg Gin Leu Ala Gly Val Pro 
4430 4435 4440 

Glu Glu Leu Asp Leu Pro Val Asp Arg Ala Arg Pro Ala Glu Ala 
4445 4450 4455 



Ser His Arg Gly His Thr Val Glu Phe Ala Val Pro Pro Ala Val 
4460 4465 4470 



His His Gin Leu Ala Glu Leu Ala Arg Arg Asn Gly Val Thr Val 
4475 4480 4485 

Phe Met Thr Val Gin Thr Ala Leu Ala Val Leu Leu Ser Lys Leu 
4490 4495 4500 

Gly Ala Gly Thr Asp He Pro He Gly Val Ala Val Ala Gly Arg 
4505 4510 4515 

Thr Asp Pro Thr Leu Asp Asn Leu He Gly Phe Phe Val Asn Thr 
4520 4525 4530 

Leu Val Leu Arg Thr Asp Leu Thr Gly Asn Pro Thr He Thr Asp 
4535 4540 4545 

Leu Leu His Arg Thr Arg Asp Thr Thr Leu His Ala Phe Thr His 
4550 4555 , 4560 

Gin Asp Val Pro Phe Glu Lys Leu Val Glu Asp Leu Ala Pro Thr 
4565 4570 4575 

Arg Ser Leu Ala Arg His Pro Leu Phe Gin Val Met Met Thr Leu 
4580 4585 4590 

Gin Ser Ala Ser Ala Asp Glu Glu Pro Leu Ala Leu Ala Gly Leu 
4595 4600 4605 

Arg Val Thr Asp Leu Pro Ala Gly Glu Thr Pro Ala Lys Val Asp 
4610 4615 4620 

Leu Asp Leu Thr Leu His Glu Val Ala Gly Arg Asp Gly Met His 
4625 4630 4635 

Ala Thr Leu Leu Gly Ala Ala Asp Leu Phe Glu Gin Glu Thr Val 
4640 4645 4650 

Arg Ala Leu Ala Asp Arg Leu Leu Arg Thr Leu Glu Ala Met Ala 
4655 4660 4665 

Ala Ala Pro Asp Asp Arg Leu Asp Arg He Glu Val Leu Ser Pro 
4670 4675 4680 

Gly Glu Arg Ser Arg Leu Leu Val Glu Trp Asn Asp Thr Ala Arg 
4685 4690 4695 

Pro Val Val Glu Ser Ser Val Pro Ala Leu Phe Ala Glu Gin Val 
4700 4705 4710 

Ala Ala Ala Pro Asp Ala Val Ala Val Val Gly Glu Gly Val Ser 
4715 4720 4725 

Trp Thr Tyr Arg Glu Leu Asp Ala Arg Ser Asp Ala Leu Ala Arg 
4730 4735 4740 



Ser Leu Val Ala Ala Gly Val Gly Val Glu Ser Pro Val Val Val 
4745 4750 4755 



Ala Leu Glu Arg Ser Pro Glu Val Leu Ser Ala Phe Leu Ala Val 
4760 4765 4770 



Ala Lys Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp Pro 
4775 4780 4785 

Gin Ala Arg Val Asp Ala Val Val Ala Asp Cys Gly Ala Arg lie 
4790 4795 4800 

Ala Val Ala Asp Arg Pro Met Ser Gly Leu Thr Val Val Ser Ala 
4805 4810 4815 

Gly Leu Gly Gly Asp Ser Ala Val Val Ser Gly Asp Leu Thr Ala 
4820 4825 4830 

Asp Arg Ala Val Val Leu Pro Ala Gly Pro Val Pro Gly Ala Ala 
4835 4840 4845 

Val Tyr Arg Met Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly 
4850 . 4855 4860 

Val Val Thr Thr His Gin Asn Leu Val Asp Leu Ala Thr Asp Thr 
4865 4870 4875 

Cys Trp Gly Pro Thr Pro Arg Val Leu Phe His Ala Pro His Ala 
4880 4885 4890 

Phe Asp Ala Ser Ser Tyr Glu lie Trp Val Pro Leu Leu Asn Gly 
4895 4900 4905 

Gly Thr Val Val Val Ala Pro Arg Arg Ser lie Asp Ala Thr Val 
4910 4915 4920 

Leu Arg Asp Leu lie Gly Ala His Glu Leu Thr His Val His Val 
4925 4930 4935 

Thr Ala Gly Leu Leu Arg Val Leu Asp Pro Ser Cys Phe Ala Gly 
4940 4945 4950 

Leu Thr Glu Val Leu Thr Gly Gly Asp Ala Val Ser Ala Glu Ala 
4955 4960 4965 

Val Arg Arg Val Lys Asp Ala Asn Pro Gly Leu Arg Val Arg Gin 
4970 4975 4980 

Leu Tyr Gly Pro Thr Glu Val Thr Leu Cys Ala Thr Gin His Leu 
4985 4990 4995 



Leu 



<210> 15 

<211> 4999 

<212> PRT 

<213> Ac tinop lanes sp , 

<400> 15 



Met lie Pro Leu Ser Phe Ala Gin Arg Arg Leu Trp Phe Leu Gly Arg 
15 10 15 



Leu Glu Gly Pro Ser Ala Thr Tyr Asn lie Pro Leu Val Leu Gly Leu 
20 25 30 

Thr Gly Thr Val Asp Ala Ala Ala Leu Glu Thr Ala Leu Arg Asp Val 
35 40 45 

Leu Glu Arg His Glu Val Leu Arg Thr Val Tyr Pro Asp Ala Gly Gly 
50 55 60 

Glu Pro His Gin Arg lie Leu Pro Leu Gly Glu Thr Gly Phe Gly Leu 
65 70 75 80 

Arg Val Ala Glu Val Thr Asp Gly Glu Leu Asp Ala Ala Val Ala Asp 
85 90 95 

Ala Thr Gly His Ala Phe Asp Leu Ala Thr Glu lie Pro Val Arg Ala 
100 105 110 

Ser Leu Leu Thr Val Glu Pro Gly Arg His Val Leu Ala Leu Val Leu 
115 120 125 

His His He Ala Ala Asp Gly Trp Ser Met Gly Pro Leu Leu Arg Asp 
130 135 140 

Leu Ser Thr Ala Tyr Thr Ala Arg Leu Ala Gly Gly Glu Pro Ala Trp 
145 150 155 160 

Ser Pro Leu Pro Val Gin Tyr Ala Asp Tyr Ala Leu Trp Gin Gin Glu 
165 170 175 

Val Leu Gly Ala Gly Asp Asp Pro Glu Ser Leu Leu Arg Glu Gin Val 
180 185 190 

Gly Tyr Trp Arg Ser Ala Leu Ala Gly Ala Pro Glu Glu Leu Arg Leu 
195 200 205 

Pro Ala Asp His Arg Arg Pro Pro Val Ser Ser Ser Arg Ala His Met 
210 215 220 

Ala Glu Phe Ala Val Pro Ala Ala Ala His Gly Asp Leu Thr Ala Leu 
225 230 235 240 

Thr Arg Glu Leu Gly Ala Thr Leu Phe Met Ala Val His Ala Ala Thr 
245 250 255 

Ala Met Val Leu Ser Gly Leu Gly Ala Gly Asp Asp Leu Pro He Gly 
260 265 270 

Thr Val Val Ala Gly Arg Thr Asp Ala Gly Leu Asp Asp Leu Val Gly 
275 280 285 

Cys Phe Val Asn Asn Leu Val He Arg Ala Asp Leu Thr Gly Asp Pro 
290 295 300 

Thr Phe Ala Asp Leu Leu Arg Gin Val Arg Glu Arg Ala Leu Asp Ala 
305 310 315 320 

Tyr Gly His Gin Asp Val Pro Phe Glu Lys Leu Val Glu Glu Leu Ala 
325 330 335 



Pro Ser Arg Ser Leu Ser Arg His Pro Leu Phe Gin Val Ala Val Ala 



340 345 350 

Val Glu Thr Asp Asp Leu lie Gly Gly Arg Gly Gly Gly Pro Ala Leu 
355 360 365 

Arg Leu Pro Gly Leu Gly He Glu Val Leu Pro Gly Glu Pro Ser Ala 
370 375 380 

Arg Asp Leu Asp Leu Asp Leu Val Val Arg Glu Thr Phe Asp Ala Glu 
385 390 395 400 

Gly Arg Pro Ala Gly Leu Thr Gly Ala Leu He Gly Ala Ala Gly Leu 
405 410 415 

Phe Asp Ala Ala Ser Val Glu Arg Leu Ala Ala Leu Leu Ala Arg Ala 
420 425 430 

Leu Glu Ala Leu Ala Ala Asp Pro Arg Thr Arg Ala Gly Asp Leu Asp 
435 440 445 

Leu Leu Ser Pro Ala Asp Arg Arg Leu He Leu Arg Gly Trp Asn Asp 
450 455 460 

Thr Ala Ala Pro Ala Pro Ala Gly Leu Val Pro Asp Leu Phe Ala Ala 
465 470 475 480 

Gin Ala Ala Arg Thr Pro Asp Ala Val Ala Val Ala Gly Pro Asp Arg 
485 490 * 495 

Glu Leu Thr Tyr Ala Glu Leu Asp Glu Arg Ser Gly Arg Leu Ala Arg 
500 505 510 

Trp Leu He Arg Arg Gly Val Ala Ala Asp Thr Arg Val Ala Leu Val 
515 520 525 

Leu Glu Arg Ser Ala Glu Leu Pro Val Ala He Leu Ala Val Leu Lys 
530 535 540 

Ala Gly Gly Ala Tyr Leu Pro He Asp Pro Ala Gin Pro Pro Arg Arg 
545 550 555 560 

He Ala Asp He Val Ala Asp Ala Ala Pro Ala Leu Val Leu Ala Gin 
565 570 575 

Ala Ser Thr Ala Asp Val Val Ala Asp Ala Ser Pro Ala Leu Val Leu 
580 585 590 

Ala Pro Ala Ser Asp Gly Val Pro Thr Gly Ala Val Pro Val His Leu 
595 600 605 

Leu Asp Ser Pro Ala Val Arg Asp Glu Val Ala Gin Cys Pro Ala Gly 
610 615 620 

Ala Val Thr Asp Ala Asp Arg Arg Gly Val Leu Leu Gly Gly His Ala 
625 630 635 640 

Ala Tyr Val He Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val 
645 650 655 

Val Val Ser His Asp Ala Phe Ala Asn Leu Val Leu Asp Gin Arg Arg 
660 665 670 



Leu Gly lie Gly 
675 

Phe Asp Met Phe 
690 

Ala Leu Val lie 
705 

Ala Phe Leu Thr 



Val Val Ala Thr 
740 

Asp lie Gly Gly 
755 

Arg Asp Gly Arg 

770 

Val Asn Ala Ala 
785 

Thr Pro lie Gly 



Gly Arg Leu Arg 
820 

Ser Gly Ala Gly 
835 

Ala Gly Ser Phe 
850 

Arg Thr Gly Asp 
865 

Ala Gly Arg Ala 



Pro Gly Glu Val 
900 

Val Ala Val lie 
915 

Ala Tyr Val Val 
930 

Gin Arg Leu Pro 
945 

Glu Leu Pro Leu 



Val Pro Asp Tyr 
980 



Pro Gly Ser Arg 
680 

Val Asp Glu Trp 
695 

Val Pro Pro Glu 
710 

Glu Arg Gly Val 
725 

Leu Pro Glu Glu 



Asp Ala Leu Pro 
760 

Trp Leu Gly Asn 
775 

Thr Trp Arg Cys 
790 

Arg Pro Val Ala 
805 

Pro Val Pro Val 



Leu Ala Arg Gly 
840 

Val Ala Cys Pro 
855 

lie Val Arg Trp 
870 

Asp Asp Gin Ala 
885 

Glu Ala Val Leu 



Val Arg Glu Asp 
920 

Gly Gly Asp Val 
935 

Gly Tyr Leu Val 
950 

Thr Pro Ser Ala 
965 

Gly Arg Asp Ala 



Val Ala Gin Phe 



Ser Met Ala Leu 
700 

Arg Arg Leu Gly 
715 

Thr His Ala Thr 
730 

Ser Leu Pro Arg 
745 

Asp Asp Leu Ala 



Ser Tyr Gly Pro 
780 

Glu Pro Gly Thr 
795 

Asn Leu Arg Ala 
810 

Gly Val Glu Gly 
825 

Tyr Leu Asn Arg 



Phe Glu Pro Gly 
860 

Asp Ala Arg Gly 
875 

Lys lie Arg Gly 
890 

Ala Ala Gly Pro 
905 

Val Pro Gly Asp 



Glu Thr Leu Arg 
940 

Pro Ser Ala lie 
955 

Lys Val Asp Arg 
970 

Gly Gly Gly Arg 
985 



Ala Ser Pro Gly 
685 

Leu Ala Gly Ala 



Ala Asp Leu Ala 
720 

Leu Pro Pro Ala 
735 

Ser Phe Val Leu 
750 

Arg Arg Trp Leu 
765 

Thr Glu Thr Thr 



Trp Glu Gly Ala 
800 

Tyr Val Leu Asp 
815 

Glu Leu Tyr Val 
830 

Ala Gly Leu Thr 
845 

Glu Arg Met Tyr 



Arg Leu Val Tyr 
880 

Phe Arg Val Glu 
895 

Gly Val Asn Gin 
910 

Lys Arg Leu Val 
925 

Ser Tyr Ala Gin 



Val Ala Leu Ala 
960 

Arg Ala Leu Pro 
975 

Ala Pro Ala Asn 
990 



Ala Arg Glu Glu Val Leu Cys Arg Ala Phe Ala Glu Val Leu Gly Val 
995 1000 1005 

Glu Arg Val Gly Val Glu Asp Asp Phe Phe Ala Leu Gly Gly His 
1010 1015 1020 

Ser Leu Leu Val Val Ser Leu Val Glu Arg Leu Arg Arg Gin Gly 
1025 1030 1035 

lie Ser Val Pro Val Arg Ala Leu Phe Thr Thr Pro Thr Pro Ala 
1040 1045 1050 

Gly Leu Ala Glu Ala Val Gly Asp Gly Ala Val Val Val Pro Pro 
1055 1060 1065 

Asn Leu lie Pro Glu Gly Ala Ala Glu Leu Thr Pro Glu Met Leu 
1070 1075 1080 

Pro Leu Ala Asp Leu Thr Ala Asp Glu Leu Ala Val Val Val Asp 
1085 1090 1095 

Ser Val Pro Gly Gly Ala Ala Asn lie Ala Asp Val Tyr Pro Leu 
1100 1105 1110 

Ala Pro Leu Gin Glu Gly lie Phe Phe His His Met Met Ala Asp 
1115 1120 1125 

Arg Asp Ser Ala Asp Val Tyr Val Thr Pro Thr Val Val Glu Phe 
1130 1135 1140 

Asp Ser Arg Asp Arg Leu Asp Gly Phe Leu Ala Ala Leu Gin Gin 
1145 1150 1155 

Val Val Asp Arg Thr Asp Val Tyr Arg Thr Ser Val Val Trp Gin 
1160 1165 1170 

Gly Leu Arg Glu Pro Val Gin Val Val Trp Arg His Ala Arg Leu 
1175 1180 1185 

Pro Val Asp Glu Val Val Leu Arg Asp Asp Leu Asp Pro Val Glu 
1190 1195 1200 

Gin Leu Asn Ala Leu Gly Thr Ala Trp Met Asp Leu Ser Glu Ala 
1205 1210 1215 

Pro Leu Val Gin Ala Val Val Ala Ala Arg Pro Gly Asp Pro Gin 
1220 1225 1230 

Arg Trp Leu Ala Val Leu Arg lie His His Leu Val Gin Asp His 
1235 1240 1245 

Thr Ala Leu Asp lie Leu Leu Glu Glu Leu Ala Ala Tyr Leu Ala 
1250 1255 1260 

Gly Arg Gly Gly Asp Leu Pro Glu Pro Val Pro Phe Arg Glu Phe 
1265 1270 1275 

Val Ala His Thr Arg Leu Gly Val Pro Arg Glu Glu His Glu Arg 
1280 1285 1290 



Tyr Phe Ala Gly Leu Leu Gly Asp Val Thr Glu Thr Thr Ala Pro 



1295 1300 1305 

Tyr Gly Leu Leu Asp Val His Ser Gly Gly Leu Ala Ser Ala Gin 
1310 1315 . 1320 

Ala His Leu Arg Leu Asp Gly Pro Leu Gly Arg Arg Val Ala Ala 
1325 1330 1335 

Phe Ala Arg Glu His Gly Val Ser Pro Ala Thr Leu Phe His Leu 
1340 1345 1350 

Ala Trp Ala Arg Val Leu Gly Thr Leu Ala Gly Arg Asp Asp Val 
1355 1360 1365 

Val Phe Gly Thr Val Leu Phe Gly Arg Met Asn Ser Gly Ala Gly 
1370 1375 1380 

Ala Asp Arg Val Pro Gly Leu Phe He Asn Thr Leu Pro Val Arg 
1385 1390 1395 

Val Arg Leu Gly Ala Pro Val Gly Asp Ala Leu Asp Gly Leu Arg 
1400 1405 1410 

Asp Gin Leu He Glu Leu He Ala His Glu His Ala Pro Leu Ala 
1415 1420 1425 

Val Ala Gin Gin Ala Ala Asn Leu Phe Gly Arg Pro Leu Phe Thr 
1430 1435 1440 

Ser He Phe Asn Tyr Arg Tyr Ala Arg Gly Ala Glu Pro Ala Gly 
1445 1450 1455 

Ala Ala Leu Asp Gly He Arg Leu Leu Ser Ala Arg Asp Leu Thr 
1460 1465 1470 

Asn Tyr Pro Leu Ala Val Ala Val Asp Ala Glu Gly Asp Thr Phe 
1475 1480 1485 

Ser Leu Thr Val Asp Ala Val Ala Pro Ala Asp Pro Val Gin Val 
1490 1495 1500 

Gly Glu Leu Leu Val Thr Ala Leu Arg Asn Leu Thr Arg Thr Ala 
1505 1510 1515 

Glu Asn Ala Pro Gly Thr Pro Leu Ala Ala Val Gly Val Leu Gly 
1520 1525 1530 

Glu Asp Glu Leu Ser Arg Val Val Ser Gly Trp Asn Asp Thr Ala 
1535 1540 1545 

Arg Arg Val Arg Gin Ala Ser Val Pro Glu Leu Phe Ala Glu Arg 
1550 1555 1560 

Val Ala Ala Ala Pro Gly Ala Pro Ala Val Ala Ala Gly Asp Leu 
1565 1570 1575 

Arg Trp Thr Tyr Ala Asp Leu Asp Ala Arg Ser Asp Ala Leu Ala 
1580 1585 1590 

Arg Ser Leu Val Ala Ala Gly Val Thr Ala Glu Ser Pro Val Val 
1595 1600 1605 



Val Ala Leu Glu Arg Ser Ala Asp Val Leu Thr Ala Phe Leu Ala 
1610 1615 1620 

Val Ala Lys Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp 
1625 1630 1635 

Pro Arg Ala Arg Val Asp Ala Val lie Ala Asp Cys Ala Ala Trp 
1640 1645 1650 

He Ala Val Ala Asp Arg Pro Met Thr Gly Leu Thr Val Val Pro 
1655 1660 1665 

Ala Asn Arg Ala Gly Asp Pro Ala Val Ala Leu Pro Pro Arg Pro 
1670 1675 1680 

Leu Pro Gly Ala Ala Ala Tyr Arg Met Tyr Thr Ser Gly Ser Thr 
1685 1690 1695 

Gly Arg Pro Lys Gly Val Val Thr Thr His Gin Asn Val Val Asp 
1700 1705 1710 

Leu Val Thr Asp Arg Cys Trp Gly Pro Thr Pro Arg Val Leu Phe 
1715 1720 1725 

His Ala Pro His Ala Phe Asp Ala Ser Ser Phe Glu Leu Trp Val 
1730 1735 • 1740 

Pro Leu Leu Thr Gly Gly Thr Val Val Val Ala Pro Gly Glu Ser 
1745 1750 1755 

He Asp Thr Gly Val Leu Arg Gin Leu He Arg Ala His Glu Leu 
1760 1765 1770 

Thr His Val His Val Thr Ala Gly Leu Leu Arg Val Leu Ala Glu 
1775 1780 1785 

Asp Pro Ser Cys Phe Ala Gly Leu Thr Glu Val Leu Thr Gly Gly 
1790 1795 1800 

Asp Val Val Pro Ala Glu Ala Val Arg Arg Val Leu Asp Ala Asn 
1805 1810 1815 

Pro Gly Val Arg Val Arg Gin Leu Tyr Gly Pro Thr Glu Val Thr 
1820 1825 1830 

Leu Cys Ala Thr Gin His Val Val Arg Glu Pro Ser Pro Val Leu 
1835 1840 1845 

Pro He Gly Arg Pro Leu Asp Asn Thr Arg Val Tyr Val Leu Asp 
1850 1855 I860 

Gly Leu Leu Gin Pro Val Pro Val Gly Val Thr Gly Glu Leu Tyr 
1865 1870 1875 

He Ala Gly Ala Gly Val Ala Arg Gly Tyr Ala Asp Met Pro Gly 
1880 1885 1890 

Thr Thr Ala Glu Arg Phe Val Ala Asp Pro Phe Thr Ala Gly Gly 
1895 1900 1905 



Arg Leu Tyr Arg Thr Gly Asp Leu Val Arg Trp Thr Gly Glu Gly 
1910 1915 1920 

Glu Leu Val Phe Ala Gly Arg Ala Asp Asp Gin Val Lys lie Arg 
1925 1930 1935 

Gly Tyr Arg Val Glu Pro Gly Glu Val Glu Ala Val Leu Ala Ala 
1940 1945 1950 

Leu Pro Gly Val Ser Gin Ala Ala Val lie Val Arg Glu Asp Val 
1955 1960 1965 

Pro Gly Asp Lys Arg Leu Val Ala Tyr Leu Val Ala Ala Pro Glu 
1970 1975 1980 

Thr Val Glu Ala Ala Arg Ala His Ala Glu Gin Arg Leu Pro Ser 
1985 1990 1995 

Tyr Leu Val Pro Ser Ala Phe Val Gin Leu Asp Ala Leu Pro Leu 
2000 2005 2010 

Thr Gly Asn Gin Lys Val Asp Arg Ala Ala Leu Pro Ala Pro Leu 
2015 2020 2025 

Gly Phe Glu Ala Gly Ala Gly Arg Ala Pro Ala Asp Ala Arg Glu 
2030 2035 2040 

Glu Leu Val Gly Ala Ala Phe Ala Glu Val Leu Asp Leu Gly Arg 
2045 2050 2055 

Val Gly Pro Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu 
2060 2065 2070 

Leu Ala Leu Ala Leu Val Glu Arg Leu Arg Arg Gin Gly Leu Gly 
2075 2080 2085 

Val Ser Val Arg Ala Val Phe Asp Ala Arg Thr Pro Ala Ala Leu 
2090 2095 2100 

Thr Arg Arg Gly Asp Gly Gly Ala Asp Asp Arg Pro Ala Leu Arg 
2105 2110 2115 

Ala Gly Ala Arg Pro Ala Arg Leu Pro Leu Ser Tyr Ala Gin Arg 
2120 2125 2130 

Arg Leu Trp Phe Leu Ala Gin Leu Glu Gly Pro Ser Ala Thr Tyr 
2135 * 2140 2145 

Asn lie Pro Val Ala Leu Arg Leu Glu Gly Asp Leu Asp Arg Asp 
2150 2155 . 2160 

Ala Leu Thr Ala Ala Leu Arg Asp Val Val Ala Arg His Glu Val 
2165 2170 2175 

Leu Arg Thr Val Phe Thr Val Ala Asp Gly Glu Pro Trp Gin His 
2180 2185 2190 

lie Leu Asp Pro Ala Arg Ala Glu Pro Ala Leu Pro Val Val Asp 
2195 2200 2205 

Val Pro Ala Gly Arg Val Glu Glu Ala Val Ala Glu Ala Ala Ala 



2210 2215 2220 

Tyr Ala Phe Asp Leu Ala Arg Glu lie Pro Leu Arg Ala Val Leu 
2225 2230 2235 

Leu Ala Pro Gly Asp Gly Thr His Val Leu Val Leu Val Leu His 
2240 2245 2250 

His He Ala Ala Asp Gly Trp Ser Met Arg Pro Leu Ala Arg Asp 
2255 2260 2265 

Leu Ala Thr Ala Tyr Ala Ala Arg Arg Arg Gly Gin Ala Pro Glu 
2270 2275 2280 

Ser Glu Thr Leu Pro Val Gin Tyr Ala Asp Tyr Ala Leu Trp Gin 
2285 2290 2295 

Arg Asp Leu Leu Gly Ser Asp Ser Asp Pro Ala Ser Leu He Ser 
2300 2305 2310 

Arg Gin He Ala His Trp Arg Glu Arg Leu Asp Gly Val Pro Glu 
2315 2320 2325 

Glu Leu Asp Leu Pro Ala Asp Arg Pro Arg Pro Ala Ala Ala Ser 
2330 2335 2340 

His Arg Gly His Leu His Ser Ala Glu He Pro Ala Asp Val His 
2345 2350 2355 

Arg Ser Leu Arg Arg Val Ala Ala Asp His Gly Ala Thr Val Phe 
2360 2365 2370 

Met Thr Leu Gin Ala Ala Val Ala Val Leu Leu Ser Arg Leu Gly 
2375 2380 2385 

Ala Gly Thr Asp Val Pro He Gly Thr Val Val Ala Gly Arg Ala 
2390 2395 2400 

Asp Arg Ala Leu Glu Asn Leu Val Gly Phe Phe Val Asn Thr Leu 
2405 2410 2415 

Val Leu Arg Thr Asp Leu Thr Gly Asp Pro Arg Leu Thr Asp Val 
2420 2425 2430 

Leu Gly Gin Val Arg Glu Leu Thr Leu Arg Ala Leu Ala His Gin 
2435 2440 2445 

Asp Val Pro Phe Glu Lys Leu Val Glu Glu Leu Thr Pro Ala Arg 
2450 2455 2460 

Ser Leu Ala Arg His Pro Leu Phe Gin Val Met Val Thr Leu Asp 
2465 2470 2475 

Gly Gly Gly Pro Asp Gly Ala Glu Leu Pro Gly Leu Ala Met Ser 
2480 2485 2490 

Val Val Pro Thr Gly Ala Val Pro Ala Lys Phe Asp Leu Asp Leu 
2495 2500 , 2505 

Thr Phe Thr Glu Thr Phe Asp Ala Ala Gly Glu Pro Ala Gly Leu 
2510 2515 2520 



Arg Val Asp Leu lie Ala Ala Ala Asp Leu Phe Asp Ala Gly Thr 
2525 2530 2535 



Ala Ala Arg Leu Ala Gly Tyr Leu Ser Arg Val Leu Gly Val Leu 
2540 2545 2550 

Ala Ala Asp Pro Arg Arg Arg Leu Ala Glu Val Asp Pro Leu Glu 
2555 2560 2565 

Ala Glu Glu Ser Arg Leu Met Leu Ala Ala Gly Glu Glu Pro Ala 
2570 2575 2580 

Pro Ala Leu Pro Glu lie Thr Val Ala Ala Leu Val Ala Glu Gin 
2585 2590 2595 

Cys Ala Arg Thr Pro Gly Ala Val Ala Val Thr Gly Pro Asp Ala 
2600 2605 2610 

Ser Leu Thr Tyr Ala Glu Leu Asp Glu Arg Ala Ala Arg lie Ala 
2615 2620 2625 

Arg Trp Leu Arg Arg His Gly Ala Gly Pro Gly Ala Ala Val Cys 
2630 2635 2640 

Val Leu Met Glu Arg Ser Ala Glu Leu Val Ala Val Leu Leu Gly 
2645 2650 2655 

Val Met Arg Ala Gly Ala Ala Tyr Val Pro Val Asp Pro Ala Tyr 
2660 2665 2670 

Pro Ala Glu Arg lie Arg Phe Val Val Thr Asp Ala Arg Ala Ala 
2675 2680 2685 

Cys Val Val Ser Glu Ser Ala Ser Ala Gly Leu Val Pro Asp Gly 
2690 2695 2700 

Val Pro Cys Leu Ala lie Asp Asp Pro Ala Ala Ala Ala Glu Pro 
2705 2710 2715 

Ala Glu Pro Gly Asp Asp Pro Gly Asp Ala Ala Gly Pro Arg Pro 
2720 2725 2730 

Asp Asp Pro Ala Tyr lie lie Tyr Thr Ser Gly Ser Thr Gly Thr 
2735 2740 2745 

Pro Lys Gly Val Val Val Ser His Arg Asn Val Val Ala Leu Leu 
2750 2755 2760 

Thr Ala Thr Arg Pro Leu Phe Gly Phe Ala Gly Asp Glu Val Trp 
2765 2770 2775 

Ser Trp Phe His Ser Val Ala Phe Asp Phe Ser Val Trp Glu Leu 
2780 2785 2790 

Trp Gly Ala Leu Thr His Gly Gly Arg Val Val Val Val Pro Tyr 
2795 2800 2805 

Ala Val Ser Arg Ser Pro Arg Asp Phe Trp Glu Leu Val Val Arg 
2810 2815 2820 



95 



Glu Gly Val Thr Val Leu Ser Gin Thr Pro Ser Ala Phe Ala Gin 
2825 2830 2835 



Leu Met Ala Ala Ala Gly Asp Asp Asp Arg Asp Ala Leu Arg Phe 
2840 2845 2850 

Val Val Phe Gly Gly Glu Ala Leu Asp Pro Gly Arg Leu Ala Gly 
2855 2860 2855 

Trp Leu Ala Arg Arg Pro Asp Lys Pro Arg Leu Val Asn Met Tyr 
2870 2875 2880 

Gly lie Thr Glu Thr Thr Val His Thr Thr Tyr Gin His lie Ala 
2885 2890 2895 

Pro Gly Thr Thr Gly Ser Val lie Gly Arg Gly Leu Pro Gly Phe 
2900 2905 2910 

Gly Leu Tyr Val Leu Asp Glu Ala Leu Arg Pro Val Pro Ala Gly 
2915 2920 2925 

Val Pro Gly Glu Val Tyr Ala Arg Gly Pro Gin Val Ala Arg Gly 
2930 2935 2940 

Tyr lie Gly Arg Pro Gly Leu Thr Ala Glu Arg Phe Val Ala Ser 
2945 2950 2955 

Pro Phe Ala Pro Gly Glu Arg Met Tyr Arg Thr Gly Asp Val Ala 
2960 2965 2970 

Arg Trp Thr Ala Asp Gly Arg Leu Val Phe Ala Gly Arg Ser Asp 
2975 2980 2985 

Asp Gin lie Lys lie Arg Gly Phe Arg lie Glu Pro Gly Glu Val 
2990 2995 3000 

Glu Ala Val Leu Ala Ala Gly Pro Gly Val Ser Gin Ala Ala Val 
3005 3010 3015 

lie Val Arg Glu Asp Val Pro Gly Asp Lys Arg Leu Val Ala Tyr 
3020 3025 3030 

Val Val Gly Gly Asp Ala Glu Thr Leu Arg Ser His Ala Gin Gin 
3035 3040 3045 

Arg Leu Pro Gly Tyr Leu Val Pro Ser Ala Phe Val Glu Leu Asp 
3050 3055 3060 

Arg Leu Pro Leu Thr Val Asn Gly Lys Leu Asp Arg Arg Ala Leu 
3065 3070 3075 

Pro Val Pro Asp Tyr Gly Arg Asp Ala Gly Gly Gly Arg Ala Pro 
3080 3085 . 3090 

Ala Asn Ala Arg Glu Glu Val Leu Cys Arg Ala Phe Ala Glu Val 
3095 3100 3105 

Leu Gly Val Glu Arg Val Gly Val Glu Asp Asp Phe Phe Ala Leu 
3110 3115 3120 

Gly Gly His Ser Leu Leu Val Val Ser Leu Val Glu Arg Leu Arg 
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3125 3130 3135 

Arg Gin Gly He Ser Val Pro Val Arg Ala Leu Phe Thr Thr Pro 
3140 3145 3150 

Thr Pro Ala Gly Leu Ala Glu Ala Val Gly Asp Gly Ala Val Val 
3155 3160 3165 

Val Pro Pro Asn Leu He Pro Glu Asp Ala Ala Glu Leu Thr Pro 
3170 3175 3180 

Glu Met Leu Pro Leu Ala Asp Leu Thr Ala Asp Glu Leu Ala Val 
3185 3190 3195 

Val Val Ala Ser Val Pro Gly Gly Ala Ala Asn He Ala Asp Val 
3200 3205 3210 

Tyr Pro Leu Ala Pro Leu Gin Glu Gly He Phe Phe *His His Met 
3215 3220 3225 

Met Ala Asp Arg Asp Ser Ala Asp Val Tyr Val Thr Pro Thr Val 
3230 3235 3240 

Val Glu Phe Asp Ser Arg Asp Arg Leu Asp Gly Phe Leu Ala Ala 
3245 3250 3255 

Leu Gin Gin Val Val Asp Arg Thr Asp Val Tyr Arg Thr Ser Val 
3260 3265 3270 

Val Trp Gin Gly Leu Arg Glu Pro Val Gin Val Val Trp Arg His 
3275 3280 3285 

Ala Arg Leu Pro He Asp Glu Val Glu Leu His Glu Gly Thr Asp 
3290 3295 3300 

Pro Ala Glu Gin Leu He Ala Leu Ala Thr Glu Arg Val Asp Leu 
3305 3310 3315 

Asp Arg Ala Pro Leu He Arg Thr Thr Thr Ala Ala Val Pro Gly 
3320 3325 3330 

Ser Gly Arg Trp Leu Ala Leu Leu Arg He His His Leu Val Gin 
3335 3340 3345 

Asp His Thr Thr Leu Asp Val Leu Leu Gly Glu Leu Arg Ala Phe 
3350 3355 3360 

Leu Glu Gly Arg Gly Asp Glu Leu Pro Glu Pro Val Pro Phe Arg 
3365 3370 3375 

Glu Phe Val Ala Gin Ala Arg Leu Gly Val Pro Arg Glu Glu His 
3380 3385 3390 

Glu Arg Tyr Phe Ala Glu Leu Leu Gly Asp Val Thr Glu Thr Thr 
3395 3400 3405 

Ala Pro Tyr Gly Leu Thr Glu Val His Gly Asp Gly Ser Ala Ala 
3410 3415 3420 

Val His Ser Arg Arg Glu Val Asp Asp Asp Leu Ala Ala Arg Leu 
3425 3430 3435 



His Arg Leu Ala Arg Ser Leu Gly Val Ser Pro Ala Ala Leu Phe 
3440 3445 3450 

His Leu Ala Trp Ala Arg Val Leu Gly Ala Val Ser Gly Arg Asp 
3455 3460 3465 

Asp Val Val Phe Gly Thr Val Leu Phe Gly Arg Met Asn Ser Gly 
3470 3475 3480 

Ala Ala Ala Asp Arg Val Gin Gly Leu Phe lie Asn Thr Leu Pro 
3485 3490 3495 

Val Arg Val Arg Leu Ala Ala Gly Ser Thr Arg Asp Ala Leu Thr 
3500 3505 3510 

Gly Leu Arg Asp Gin Leu Ala Gly Leu Leu Val His Glu His Ala 
3515 3520 3525 

Pro Leu Ala Leu Ala Gin Arg Ala Ala Gly lie Thr Asp Gly Ser 
3530 3535 3540 

Pro Leu Phe Ala Ser lie Phe Asn Tyr Arg His Asn Gin Asp Asp 
3545 3550 3555 

Pro Ala Ala Ser Ala Gly Leu Glu Gly lie Arg Thr Val Tyr Ser 
3560 3565 3570 

Ala Glu His Thr Asn Tyr Pro Leu Asp Ala Ser lie Asp Val Thr 
3575 3580 3585 

Gly Asp Arg Phe Ala lie Thr Val Asn Ala Val Ala Ala Asp Ala 
3590 3595 3600 

Ala Arg lie Ala Glu Leu Met His Thr Cys Leu Gly His Leu Ala 
3605 3610 3615 

Asp Val Leu Glu Asp Ala Pro Glu Thr Pro Leu Ser Trp Val Ser 
3620 3625 3630 

Pro Leu Ser Ala Glu Asp Leu Gly Arg lie Val Gly Asp Trp Asn 
3635 3640 3645 

Glu Thr Arg Arg Ala Val Thr Arg Ala Ser Val Pro Glu Leu Phe 
3650 3655 3660 

Ala Lys Gin Val Ala Ala Thr Pro Asp Ala lie Ala Val Ala Gly 
3665 3670 3675 

Glu Gly Val Ser Trp Ser Tyr Arg Glu Leu Asp Val Arg Ser Asp 
3680 3685 3690 

Ala Leu Ala Arg Ser Leu Val Ala Ala Gly Val Gly lie Glu Ser 
3695 3700 3705 

Pro Val Val Val Ala Leu Asp Arg Ser Pro Glu Val Pro Thr Ala 
3710 3715 3720 

Phe Leu Ala Val Ala Lys Ala Gly Gly Val Phe Val Pro Val Asp 
3725 3730 3735 



Leu Ser Trp Pro Gin Ala Arg Val Asp Ala Val lie Ala Asp Cys 
3740 3745 3750 



Ala Ala Arg Val Ala Val Ala Asp Arg Pro Met Thr Gly Leu Thr 
3755 3760 3765 

Val Val Pro Ala Asp Ala Ala Gly Asp Pro Ala Ala Glu Leu Pro 
3770 3775 3780 

Pro Arg Pro Leu Pro Gly Ala Glu Val Tyr Arg Met Tyr Thr Ser 
3785 3790 3795 

Gly Ser Thr Gly Arg Pro Lys Gly Val Val Thr Thr His Gin Asn 
3800 3805 3810 

Leu Val Asp Leu Ala Thr Asp Thr Cys Trp Gly Pro Thr Pro Arg 
3815 3820 3825 

Val Leu Phe His Ala Pro His Ala Phe Asp Ala Ser Ser Tyr Glu 
3830 3835 3840 

lie Trp Val Pro Leu Leu Asn Gly Gly Thr Val Val Val Ala Pro 
3845 3850 3855 

Gly Arg Ser lie Asp Ala Ala Val Leu Gly Glu Leu lie Arg Ala 
3860 3865 3870 

His Glu Leu Thr His Val His Val Thr Ala Gly Leu Leu Arg Val 
3875 3880 3885 

Leu Asp Pro Ser Cys Phe Ala Gly Leu Thr Glu Val Leu Thr Gly 
3890 3895 3900 

Gly Asp Ala Val Ser Ala Glu Ala Val Arg Arg Val Met Glu Ala 
3905 ' 3910 3915 

Asn Pro Gly Leu Arg Val Arg Gin Leu Tyr Gly Pro Thr Glu Val 
3920 3925 3930 

Thr Leu Cys Ala Thr Gin Gin Val Leu Asp Gly Thr Gly Val Pro 
3935 3940 3945 

lie Gly Arg Pro Leu Asp Asn Thr Arg Val Tyr Val Leu Asp Asp 
3950 3955 3960 

Leu Leu Gin Pro Val Pro Val Gly Val Thr Gly Glu Leu Tyr Val 
3965 3970 3975 

Ala Gly Ala Gly Leu Ala Arg Gly Tyr Ala Gly Met Pro Gly Leu 
3980 3985 3990 

Thr Ala Glu Arg Phe Val Ala Asp Pro Phe Ser Ser Gly Gly Arg 
3995 4000 4005 

Leu Tyr Arg Thr Gly Asp Leu Val Arg Trp Thr Asp Asp Gly Val 
4010 4015 4020 

Leu Val Phe Ala Gly Arg Ala Asp Asp Gin Val Lys lie Arg Gly 
4025 4030 4035 



Tyr Arg Val Glu Pro Gly Glu Val Glu Ala Val Leu Ala Ala His 



4040 



4045 



4050 



Pro Asp Val Ala Gin Val Ala Val Val Val Arg Glu Asp Thr Pro 
4055 4060 4065 

Gly Asp Lys Arg Leu Val Ala Tyr Val Val Gly Gly Asp Val Glu 
4070 4075 4080 

Ala Tyr Ala Gin Glu Arg Leu Pro Gly Tyr Leu Val Pro Ser Ala 
4085 4090 4095 

Phe Val His Leu Asp Ala Leu Pro Leu Thr Ser Asn Gin Lys Val 
4100 4105 4110 

Asp Arg Ala Ala Leu Pro Ala Pro Ser Val Glu Ser Gly Ala Gly 
4115 4120 4125 

Arg Ala Pro Ala Asp Ala Arg Glu Glu Leu Met Cys Ala Ala Phe 
4130 4135 4140 

Ala Glu Val Leu Asp Leu Asp Arg Val Gly Val Asp Asp Asp Phe 
4145 4150 4155 

Phe Ala Leu Gly Gly His Ser Leu Leu Val Val Arg Leu Val Gly 
4160 4165 4170 

Arg lie Arg Gin Val Phe Gly Val Glu Val Ser Ala Arg Leu Val 
4175 4180 4185 

Phe Asp Ala Arg Thr Pro Ala Gly Val Val Ala Arg Leu Ser Glu 
4190 4195 4200 

Gly Gly Thr Ala Arg Glu Ala Val Arg Ala Arg Val Arg Pro Ala 
4205 4210 4215 

Arg Val Pro Leu Ser Phe Ala Gin Arg Arg Leu Trp Phe Leu Ser 
4220 4225 4230 

Gin Leu Asp Gly Thr Ser Thr Thr Tyr Asn lie Pro Val Ala Leu 
4235 4240 4245 

Gin Leu Asp Gly Pro Leu Asp Arg Asp Ala Phe Thr Ala Ala Leu 
4250 4255 4260 

His Asp Val Val Ala Arg His Glu Val Leu Arg Thr Val Phe Thr 
4265 4270 4275 

Val Ala Asp Gly Glu Pro Trp Gin His lie Leu Asp Thr Pro Ser 
4280 4285 4290 

Val Ser Val Pro Val lie Glu Val Pro Ala Asp Gly Leu Pro Glu 
4295 4300 4305 

Ala Val Ala Ala Ala Ala Ala His Thr Phe Asp Leu Ser Arg Glu 
4310 4315 4320 

He Pro Leu Arg Ala Val Leu Leu Ala Thr Gly Ala Asp Arg His 
4325 4330 4335 



Val Leu Val Leu Val Val His His He Ala Ala Asp Gly Trp Ser 
4340 4345 4350 



Met Gin Pro Leu Ala Arg Asp Leu Ala Val Ala Tyr Ala Ala Arg 
4355 4360 4365 



lie Arg Gly Glu Ala Pro Ala Trp Thr Ala Leu Pro Val Gin Tyr 
4370 4375 4380 

Ala Asp Tyr Ala Leu Trp Gin Arg Asp Val Leu Gly Ser Glu His 
4385 4390 4395 

Asp Pro Asp Ser Ala lie Ser Gin Gin Val Ala His Trp Arg Arg 
4400 4405 4410 

Gin Leu Ala Gly Ala Pro Asp Glu Leu Pro Leu Pro Ala Asp His 
4415 4420 4425 

Pro Arg Pro Ala Glu Ala Thr Tyr Arg Gly His Thr Val Glu Phe 
4430 4435 4440 

Thr Val Pro Pro Ala Val His His Gin Leu Ala Glu Leu Ala Arg 
4445 4450 4455 

Arg Asn Gly Val Thr Val Phe Met Thr Val Gin Thr Ala Leu Ala 
4460 4465 4470 

Val Leu Leu Ser Lys Leu Gly Ala Gly Thr Asp He Pro lie Gly 
4475 4480 4485 

Val Ala Val Ala Gly Arg Thr Asp Pro Thr Leu Asp Asn Leu He 
4490 4495 4500 

Gly Phe Phe Val Asn Thr Leu Val Leu Arg Thr Asp Leu Thr Gly 
4505 4510 4515 

Asn Pro Thr He Thr Asp Leu Leu His Arg Thr Arg Asp Thr Thr 
4520 4525 4530 

Leu His Ala Phe Thr His Gin Asp Val Pro Phe Glu Lys Leu Val 
4535 4540 4545 

Glu Asp Leu Ala Pro Thr Arg Ser Leu Ala Arg His Pro Leu Phe 
4550 4555 4560 

Gin Val Met Met Thr Leu Gin Ser Thr Gly Arg Ala Gly Glu Ala 
4565 4570 4575 

Ala Glu Leu Pro Gly Leu Glu Thr Ala Val Leu Ser Pro Gly Gly 
4580 4585 4590 

Val Ala Ala Lys Val Asp Leu Asp Leu Ser Leu Ser Glu Ala Tyr 
4595 4600 4605 

Asp Asp Asp Gly Arg Pro Ala Gly Leu Ala Gly Thr Leu Val Ala 
4610 4615 4620 

Ala Ala Asp Leu Phe Glu His Gly Thr Ala Glu Arg He Ala Gly 
4625 4630 4635 

Tyr Leu Ala Arg Leu Leu Ala Val Leu Pro Ala Asp Pro Gly Ala 
4640 4645 4650 



101 



Arg Leu Gly Asp Val Asp Leu Leu Asp Gly Glu Glu Arg Arg Leu 
4655 4660 4665 



Val Leu Thr Gly Trp Asn Asp Thr Thr Ala Ala Val Pro Ala Val 
4670 4675 4680 

Ala Val Pro Glu Leu lie Glu Arg Arg Ala Ala Ala Glu Pro Glu 
4685 4690 4695 

Ala Gly Ala Val Trp Cys Gly Asp Thr His Leu Arg Tyr Gly Glu 
4700 4705 4710 

Leu Asn Ala Arg Ala Asn Arg Leu Ala Arg Leu Leu Val Glu Arg 
4715 4720 4725 

Gly Ala Gly Pro Glu Ser lie Val Ala Val Cys Leu Glu Arg Ser 
4730 4735 4740 

Ala Asp Leu Val Val Thr Leu Leu Ala Val Leu Lys Thr Gly Ala 
4745 4750 4755 

Ala Tyr Leu Pro lie Asp Pro Gly Tyr Pro Ala Gly Arg lie Ala 
4760 4765 4770 

Tyr Met Leu Ala Asp Ala Arg Pro Ala Leu Leu Val Thr Ser Pro 
4775 4780 4785 

Ala Val Ala Ser Gly Asp Ser Leu Pro Asp Gly Gly Ala Gin Arg 
4790 4795 4800 

lie Val Leu Gly Asp Pro Asp Thr Ala Ala Ala Leu Asp Gly Leu 
4805 4810 4815 

Ala Gly Thr Asp Leu Leu Val Ser Glu Arg Arg Gly Val Thr His 
4820 4825 4830 

Pro Ala His Pro Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly 
4835 4840 4845 

Arg Pro Lys Gly Val Val Val Pro His Gly Ala Leu Thr Asn Phe 
4850 4855 4860 

Val Ala Ala Met Ser Asp Arg Leu Ala Leu Gly Ala Gly Asp Arg 
4865 4870 4875 

Leu Leu Ala Val Thr Thr Val Ala Phe Asp lie His Val Leu Glu 
4880 4885 4890 

Leu Tyr Val Pro Leu Val Gly Gly Ala Gly Val Val Val Ala Glu 
4895 4900 4905 

Asp Ala Val Val Arg Asp Pro Ala Ala Val Ala Ala Leu Leu Asp 
4910 4915 4920 

Arg His Ala Val Thr lie Val Gin Ala Thr Pro Ala Leu Trp Gin 
4925 4930 4935 

Ala Leu Leu Ala Gly His Ala Asp Ala Val Arg Asp Val Arg Leu 
4940 4945 4950 



Leu Val Gly Gly Glu Ala Leu Pro Pro Ala Leu Ala Gly Arg Met 



4955 



4960 



4965 



Ala Ala Ala Gly Arg Gly Val Thr Asn Leu Tyr Gly Pro Thr Glu 
4970 4975 4980 

Val Thr Val Trp Ala Thr Val Ala Asp Leu Gly Ala Ser Pro Ala 
4985 4990 4995 

Gly 



<210> 16 
<211> 234 
<212> PRT 

<213> Actinoplanes sp, 
<400> 16 

Met Gin Lys lie Pro Leu Val Cys Val Pro Phe Ala Gly Ala Gly Ala 
15 10 15 

Ser Phe Phe His Pro Trp Ala Glu Leu Ala Gly Pro Asp Arg Pro lie 
20 25 30 

Val Ala Leu Gin Leu Pro Gly Arg Glu Trp Arg Leu Leu Asp Glu Pro 
35 40 45 

Tyr Ala Asp Val Val Ala Ala Ala Ala Asp Leu Ala Leu Thr Val Ala 
50 55 60 

Asp Glu Val Gly Ala Gly Gly Arg Val Ala Leu Phe Gly His Ser Leu 
65 70 75 80 

Gly Ala Val Leu Ala Tyr Glu lie Ala His Ala Leu Val Arg Asp Gly 
85 90 95 

Glu Val Gly Val Glu Arg Leu Phe Val Ser Gly Ser Pro Asp Pro Trp 
100 105 110 

Thr Pro Arg Thr Asn Arg Ala Ser Gly Leu Asp Asp Glu Glu Phe Leu 
115 120 125 

Leu Arg Val Arg Glu Phe Ala Gly Tyr Asp His Glu Ala Leu Ala Asp 
130 135 140 

Pro Asp Met Arg Glu Leu lie Leu Pro Ala Leu Arg Ala Asp Val Glu 
145 150 155 160 

Met His Glu Ser Tyr Val Ala Gly Ser Ala Asp Pro Leu Pro Ala Pro 
165 170 175 

Val Thr Ala Leu His Ala Arg Asp Asp Ala Leu Val Ser Ala Glu Gin 
180 185 190 

Thr Ala Gly Trp Ser Lys Ala Thr Ser Gly Pro Phe Gin Leu Val Glu 
195 200 205 

Val Asp Gly Gly His Met Tyr Leu Thr Glu Asp Pro Ala Gly Leu Leu 
210 ' 215 220 



Arg Leu lie Ala Ala Asp Leu Asp Arg Asp 



225 



230 



<210> 17 
<211> 274 
<212> PRT 

<213> Actinoplanes sp . 
<220> 

<221> misc_feature 
<222> (1)..(1) 

<223> V represents a non-standard 
the biosynthesized protein 
t this position 



initiator codon. It is expected that 
will have a f ormylmethionine reidue a 



<400> 17 

Val Arg Leu Thr Gly Lys Thr Ala He Val Thr Gly Ala Ala Arg Gly 
15 10 15 

Leu Gly Arg Ala Cys Ala Val Ala Phe Ala Ala Glu Gly Ala Asp Leu 
20 25 30 

Val Leu Leu Asp Arg Ala Ala Asp Leu Pro Gly Val Pro Tyr Pro Leu 
35 40 45 

Gly Thr Val Gly Gin Leu Glu His Thr Ala Asp Leu Cys Arg Lys Gin 
50 55 60 

Gly Ala Ala Val Leu Thr Val Arg Ala Asp Val Arg Asp Leu Ala Ala 
65 70 75 80 

Leu Thr Ala Ala Ala Asp Arg Ala He Asp Arg Phe Gly Gly He Asp 
85 90 95 

val Leu val Asn Asn Ala Gly He Ala Ala Pro Ser Gly Lys Val Thr 
100 105 110 

His Glu He Thr Glu Asp Glu Trp Gin Leu Met He Asp Val Asp Leu 
115 120 125 

Ser Gly Ala Trp Arg Met Thr Ala Ala Val Gly Arg His Met Thr Glu 
130 135 140 

Arg Arg Ser Gly Ser He Val Asn He Ala Ser Thr Ala Gly >Gln Val 
145 150 155 160 

Gly Tyr Arg His Phe Ala Gly Tyr Val Ala Ala Lys His Gly He Val 
165 170 175 

Gly Leu Thr Arg Ala Ala Ala Leu Asp Tyr Ala Pro Ala Lys Val Arg 
180 185 190 

Val Asn Ala Val Cys Pro Gly Ser Val Arg Asp Asp Pro Gin Phe Glu 
195 200 205 

Gly Arg Met Leu Ser Glu He Ala Arg Ser Leu Asp Val Pro Val Ala 
210 215 220 

Glu His Glu Gin Thr Phe Leu Gin Ala Gin Pro Met Asn Ala Leu He 
225 230 235 240 



Glu Pro Asp Asp Val Ala Asn Ala Ala lie Trp Leu Ala Ser Asp Glu 
245 250 255 



Ser Arg Gin Val Thr Gly Ser Val Val Thr Val Asp Gly Gly Phe Thr 
260 265 270 

Thr Arg 



<210> 18 
<211> 891 
<212> PRT 

<213> Actinoplanes sp, 
<220> 

<221> misc_feature 
<222> (1) . . (1) 

<223> V is a non-standard initiator codon. It is expected that the bio 
synthesized protein will have a f ormylmethionine residue at this 
position 



<400> 18 

Val Pro Lys Ser Gin Pro Ala Thr Arg Thr Ala Ala Pro Gly Ala Ala 
15 10 15 

Glu Cys His Ala Leu Ala Val Arg Leu Ala Gly Pro lie Asp Pro Ala 
20 25 30 

Pro lie Glu Arg Arg Leu Ala Ala Arg Met Pro Phe Trp His Glu His 
35 40 45 

Val Ala Ala Arg Pro Gly Asp Glu Ala Ala Leu Arg Arg Arg Glu Arg 
50 55 60 

Glu Leu Ala Arg Pro Val Pro Pro Glu Pro Gly Ala Arg Ala Val Leu 
65 70 75 80 

Leu Ala Tyr Ala Asp Gly Ser Ala Asp Leu Val Leu Val Ala Arg Arg 
85 90 95 

Asp Arg Leu Asp Arg Asp Ala Leu lie Ala Leu Ala Arg Pro Glu Arg 
100 105 110 

Ala Pro Arg Gly Arg Lys Pro Ala Glu Pro Asp Ala Pro Pro Pro Ser 
115 120 125 

Ala Ala Pro Ala Trp Gly Leu Gly Asp Gly Gly Pro Asp Asp Arg Trp 
130 135 140 

Ala Glu Leu Arg Val Pro Ala Arg Gly Pro Ala Asp Pro Ala Arg Trp 
145 150 155 160 

Pro Ala Ala Leu Ala Lys Val Leu Ala Arg Tyr Glu Pro Gly Ala Ala 
165 170 175 

Ala Gly Ser Gly Ala Ala Ala Gly Leu Gly Ala Ala Ala Gly Ser Gly 
180 185 190 



105 



Val Ala Ala Gly 
195 

Pro Gly Pro Val 
210 

Glu Tyr Val Pro 
225 

Ser Arg Thr Pro 



Gly Ala Val Ser 
260 

Ala Ala His Gly 
275 

Thr Pro Pro Ala 
290 

Pro Ala Ala Ala 
305 

Ala Arg Pro Thr 



Tyr Arg Glu Leu 
340 

Lys Ala Gly Val 
355 

Ser Ala Glu Leu 
370 

Ala Tyr Val Pro 
385 

Thr Val Gly Asp 



Phe Pro Asp Val 
420 

Glu Ala Gly Asp 
435 

Glu Arg Pro Ala 
450 

Lys Gly Val Val 
465 

Thr Arg Glu Glu 



.His Ser Ala Ala 
500 



Ser Ser Ala Ala 
200 

Ala Leu Ala Phe 
215 

Phe Leu Ala Pro 
230 

Gly Gly Gly Thr 
245 

Pro Ala Ala Ala 



Glu Pro Pro Ala 
280 

Ala Pro Ala Pro 
295 

Arg Thr Leu Thr 
310 

Ala Val Ala Val 
325 

Asp Glu Trp Ser 



Arg Asp Gly Asp 
360 

Val Ala Val Leu 
375 

Leu Asp Ala Ala 
390 

Ala Gly Leu Ala 
405 

Asp Gly Val Arg 



Asp Pro Gly lie 
440 

Tyr Val lie Tyr 
455 

Val Pro His Ala 
470 

Tyr Ala Leu Gly 
485 

Phe Asp Phe Ser 



Ser Gly Ser Gly 



Asp Gly Asp Leu 
220 

Thr His Pro Leu 
235 

Glu Leu Arg Cys 
250 

Glu Ala Phe Ala 
265 

Asp Asp Gly Ala 



Ala Pro Ala Pro 
300 

Gly Leu Phe Ala 
315 

Ser Asp Asp Arg 
330 

Gly Arg Leu Ala 
345 

Ala Val Gly Val 



Leu Ala Val Leu 
380 

Tyr Pro Ala Asp 
395 ^ 

Val Val Val Thr 
410 

Leu Leu Ala Pro 
425 

Pro Leu Ala Thr 



Thr Ser Gly Ser 
460 

Asn Val Ser Ala 
475 

Pro Gly Asp Val 
490 

Val Trp Glu He 
505 



Ala Ala Ala Val 
205 

Ala Pro Pro Asp 



Thr Val Gin Val 
240 

Arg His Arg Leu 
255 

Arg Met Leu Ala 
270 

Thr Ala Glu Pro 
285 

Ala Pro Ala Pro 



Glu Gin Val Ala 

320 

Gly Arg His Thr 
335 

Arg Gly Leu Arg 
350 

Cys Leu Asp Arg 
365 

Lys Ala Gly Ala 



Arg He Ala Tyr 
400 

Thr Ser Ala Asp 
415 

Glu Ser Leu Ala 
430 

Pro Ala Gly Pro 
445 

Thr Gly Arg Pro 



Leu Leu Asp Ala 
,480 

Trp Thr Phe Phe 
495 

Trp Gly Cys Leu 
510 



Leu Thr Gly Gly His Leu Val Val Val Pro Tyr Trp Val Ser Arg Ser 



515 



520 



525 



Pro Glu Gin Phe His Asp Leu Leu Ala Glu Arg Gly Val Thr Val Leu 
530 535 540 

Asn Gin Thr Pro Ser Ser Phe Thr Gin Leu Val Ala Ala Asp Arg Gly 
545 550 555 560 

Ala Glu Arg Asp Leu Ala Val Arg Leu Val lie Phe Gly Gly Glu Pro 
565 570 575 

Leu Asp Ala Arg Thr Val Leu Pro Trp Leu Asp Arg Arg Pro Glu Ala 
580 585 590 

Arg Cys Arg Leu Val Asn Met Phe Gly lie Thr Glu Thr Thr Val His 
595 600 605 

Val Thr Ala Val Asp Val Thr Arg Ala Ala Ala Leu Ala Gly Ser Arg 
610 615 620 

Ser Val Gly Arg Pro Leu Pro Gly Trp Ala Val Arg Val Leu Asp Glu 
625 630 635 640 

Gin Arg Arg Glu Val Pro Pro Gly Val Pro Gly Glu lie Tyr Val Gly 
645 650 655 

Gly Ala Gly Val Ala lie Gly Tyr Leu Asn Arg Pro Glu Leu Thr Ala 
660 665 670 

Glu Arg Phe Val Thr Gly Pro Asp Gly Arg Arg Trp Tyr Arg Ser Gly 
675 680 685 

Asp Arg Gly Arg Leu Leu Pro Asp Gly Thr Leu Glu His Leu Gly Arg 
690 695 700 

Leu Asp Asp Gin Val Lys Leu Arg Gly Phe Arg lie Glu Leu Asp Glu 
705 710 715 720 

lie Arg Gly Val Leu Thr Glu Cys Ala Gly Val Ala Ala Ala Ala Val 
725 730 735 

Val lie Arg Arg Ser Thr Pro Asp Asp Pro Ala Thr Ala Arg Leu Asp 
740 745 750 

Ala Tyr Val Val Ala Glu Ala Gly Ala Thr Pro Pro Val Ala Glu His 
755 760 765 

Ala Ala Arg Met Leu Pro Ala Tyr Met Cys Pro Ala Thr Phe Thr Phe 
770 775 780 

Leu Asp Ala Leu Pro Met Thr Pro Asn Gly Lys Val Asp Lys Ala Ala 
785 790 795 800 

Leu Pro Glu Pro Ala Arg Pro Ala Ala Asp Ala Ala Ala Thr Pro Ala 
805 810 815 

Gly Pro Gly Glu Asp Gly Leu Ala Gly Asp Leu Ala Asp Val Trp Gin 
820 825 830 



Gin Val Phe Gly Cys Pro Val Thr Val Ser Asp Asn Phe Phe Asp Leu 
835 840 845 



Gly Gly Asn Ser Leu Leu Ala Val Arg Met Ala Ala Leu Met Arg Arg 
850 855 860 

Arg Gly Leu Pro Arg Leu His Pro Arg Thr Leu Tyr Leu His Pro Thr 
865 870 875 880 

Val Arg Gly Leu Ala Asp Ala Leu Arg Ser Ala 
885 890 

<210> 19 
<211> 187 
' <212> PRT 
<213> Actinoplanes sp. 

<400> 19 

Met Arg Asn Leu Arg Arg Thr Thr Gly He Gly Leu Leu Ala Leu Leu 
15 10 15 

Ser Val Ala Ala Cys Ser Ser Thr Pro Ala Ala Ser Glu Pro Pro Pro 
20 25 30 

Ser Ala Ala Pro Pro Ser Ala Val Thr Ala Thr Gly Pro Ala Ala Glu 
35 40 45 

Lys Ala Val Lys Ser Gly Thr Gin Thr Tyr His Gin Ala Leu Asp Ala 
50 55 60 

Phe Val Ala Ala Ser Asn Lys Gly Thr Thr Asp Thr Thr Glu He Gly 
65 70 75 80 

Lys Tyr Ala Ser Gly Arg Ala Leu Met Thr Phe Gin Gly He Leu Ala 
85 90 95 

Ser Tyr Gin Gin Gin Gly Val His Thr Ser Gly Glu Pro Arg He Asp 
100 105 110 

Glu Pro Val Val Thr Gly Leu Thr Pro Pro Ala Asp Pro Thr Gly Val 
115 120 125 

Gin Leu Arg Gly Cys He Asp He Ser Ala Trp Pro Leu Thr Lys Ala 
130 135 140 

Asp Gly Thr Pro Ala Asp Lys Val Gly Gly Gin Gin Gly Ser Gly Pro 
145 150 155 160 

Ser Ala He Leu Ala Asn Val Ala Arg Ser Gly Ala Thr Trp Gin Val 
165 170 175 

Thr Glu Leu Ala He Gin Gly Pro Cys Ala Ala 
180 185 

<210> 20 
<211> 415 
<212> PRT 

<213> Actinoplanes sp. 



<220> 

<221> misc_f eature 
<222> (1)..{1) 



<223> 



V represents a non-standard initiator codon. It is expected that 

the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 20 

Val Thr Val Arg Arg Trp Leu Pro Ala Gly Leu Thr Val Leu Ala Phe 
15 10 15 

Ala Ala Gly Phe Trp Gin Lys Leu Pro Cys Gin Ala Ala Gly Trp Pro 
20 25 30 

Asp Asp Thr Ala Thr Leu Phe Gly Arg Tyr Cys Tyr Ser Asp Val Pro 
35 40 45 

lie Leu Phe Arg Glu Arg Gly Leu Phe Asp Gly lie Phe Pro Tyr Glu 
50 55 60 

Ser Gly Pro Gly Ala Gin Pro Leu Glu Tyr Pro Val Leu Thr Gly Tyr 
65 70 75 80 

Leu Met Asp Ala Thr Ala Arg Leu Val Arg Ala lie Leu Pro Gly Ala 
85 90 95 

Asp Val Ala Val Ala Ser Arg Ala Tyr Phe Leu Thr Thr Val Leu Val 
100 105 110 

Leu Leu Ala Leu Ala Val Leu Thr Val Trp Ala Thr Gly Ala Val Leu 
115 120 125 

Arg Arg Thr Gly Gly Arg Pro Gly Asp Ala Leu Leu Val Ala Ala Ala 
130 135 140 

Pro Val Leu lie Leu Ala Gly Thr Val Asn Trp Asp Leu Leu Ala Val 
145 150 155 160 

Ala Ala Ala Val Leu Ala lie Leu Ala Trp Glu Arg Asp Arg Pro Leu 
165 170 175 

Leu Ala Gly Val Leu lie Gly Leu Gly Thr Ala Ala Lys Leu Phe Pro 
180 185 190 

Leu Val Leu Leu Gly Pro Val Leu Leu Leu Cys Leu Arg Gin Arg Arg 
195 200 205 

Met Arg Arg Phe Ala Arg Val Ala Ala Gly Ala Ala Gly Ala Trp Leu 
210 215 220 

Leu Val Asn Leu Pro Val Val Ala Leu Gin Pro Asp Gly Trp Met Glu 
225 230 235 240 

Phe Trp Arg Phe Asn Ala Gly Arg Gly Ala Glu Phe Gly Ser Leu Trp 
245 250 255 

Phe Ala Leu Asp Gly Leu Gly Leu His Met Pro Ala Val Asn Ala Val 
260 265 270 



Ala Leu Ala Thr Phe Gly Val Leu Leu Ala Gly lie Ala Val Leu Ala 
275 280 285 



Leu Arg Ser Arg Arg Pro Pro Asp Leu Ala Gin Leu Ala Cys Leu Ala 
290 295 300 



Val Gly Ala Phe Leu Leu Thr Asn Lys Val Tyr Ser Pro Gin Tyr Ala 
305 310 315 * 320 

Leu Trp Leu Leu Pro Leu Val Val lie Ala Arg Gly Arg Val Pro Arg 
325 330 335 

Trp Pro Val Val Arg Asp Trp Ala Val Trp Gin Ala Ala Glu Val Leu 
340 345 350 

Tyr Trp Leu Ala Val Trp Ser Trp Leu Ala Gly Ser Leu Thr Asp Glu 
355 360 365 

Arg Gin Tyr Ala Trp Ala Thr Val Leu Arg Val Leu Ala Thr Ala Tyr 
370 375 380 

Val Cys Gly Gin Val Val Trp Asp Val Leu Ala Ala Pro Arg Pro His 
385 390 395 400 

Arg Pro Ala Pro Pro Pro Ala Val Ala Glu Pro Ala His Pro Gly 
405 410 415 

<210> 21 
<211> 491 
<212> PRT 

<213> Actinoplanes sp, 
<220> 

<221> misc__f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 21 

Val Ala Ala Gin 
1 

Pro Gly Gly Ser 
20 

Val Leu Leu Leu 
35 

Ser Leu Leu Pro 

50 

Gly Asp Glu lie 
65 

Phe Lys Trp Gly 



Ser Pro Arg Met 
100 

Arg Arg Phe Asp 



Pro Glu Glu Phe 
5 

Thr Ala Ala Ala 



Glu Arg Glu Lys 
40 

Ser Thr Val His 
55 

Ala Lys Ala Gly 
70 

Thr Ser Thr Glu 
85 

Ala Gly Pro Thr 



Gin lie Leu Leu 



Asp Val lie Val 
10 

Leu Thr Ala Lys 
25 

Phe Pro Arg Tyr 



Gly Val Cys Asn 
60 

Phe Met Arg Lys 
75 

Pro Trp Thr Phe 
90 

Ser His Ala Phe 
105 

Glu Asn Ala Arg 



Val Gly Gly Gly 
15 

Gin Gly Ala Lys 
30 

Gin lie Gly Glu 
45 

Leu Leu Gly Val 



His Gly Gly Thr 
80 

Thr Phe Ala Thr 
95 

Gin Val Glu Arg 
110 

Arg Leu Gly Val 



110 



115 



120 



125 



Asp Val Arg Glu 
130 

Arg Val Arg Gly 
145 

Leu Ala Arg Phe 



Thr Thr Val Gly 
180 

Ala Leu Phe Gly 
195 

Ser Gly Asn lie 
210 

lie Pro Leu Ser 
225 

Glu Met Ala His 



Leu lie Ala Glu 
260 

Arg Val Thr Glu 
275 

Ser Tyr Ser Ser 
290 

Asp Ala Ala Cys 
305 

Ala Thr Tyr Ser 



Ala Gly Thr Val 
340 

Tyr Arg Arg Glu 
355 

Asp Met His Val 
370 

Thr Glu Ser Ser 
385 

Gly lie Ala Ser 



Arg Arg His Ser 
420 

Leu Glu Glu Gly 
435 



Asn His Pro Val 
135 

Val Arg Phe Thr 
150 

Val Val Asp Ala 
165 

Gly Thr Arg Glu 



Tyr Phe Glu Asn 
200 

Leu Cys Val Ala 

215 

Glu Thr Leu Thr 
230 

Lys Val Gin Gly 
245 

Cys Pro Met lie 



Gly Asp Tyr Gly 
280 

Thr Ser Tyr Trp 
295 

Phe lie Asp Pro 
310 

Gly Leu Leu Ala 
325 

Asp Glu Asp Arg 



Phe Gly Val Phe 
360 

Asp Glu Ser Ser 
375 

Ala Pro Ala Met 
390 

Gly Glu Asp Ala 
405 

Arg Gin Thr Ala 



Gly Thr Gly Phe 
440 



Thr Glu Ala lie 
140 

Gin Asp Gly Gin 
155 

Ser Gly Asn Arg 
170 

Tyr Ser Pro Phe 
185 

Gly Arg Arg Leu 



Phe Gly Ser Gly 
220 

Ser Val Gly Ala 
235 

Asp Gin Glu Lys 
250 

Ala Asp Phe Leu 
265 

Gin lie Arg Val 



Arg Pro Gly Met 
300 

Val Phe Ser Ser 
315 

Ala Arg Ser lie 
330 

Ala Phe Thr Glu 
345 

His Asp Phe Leu 



Tyr Phe Trp Ala 
380 

Glu Ser Phe Thr 
395 

Leu Thr Gly Ser 
410 

Glu Leu Gly Gin 
425 

Leu Arg Gly Ser 



Ala Asp Asp Glu 



Thr Arg Thr Ala 
160 

Ser Thr Leu His 
175 

Phe Arg Asn Leu 
190 

Pro Ala Pro Asn 
205 

Trp Phe Trp Tyr 



Val Val Arg Arg 
240 

Ala Leu Phe Glu 
255 

Gly Asp Ala Thr 
270 

Arg Lys Asp Tyr 
285 

Cys Leu Val Gly 



Gly Val His Leu 
320 

Asn Ser Val Leu 
335 

Phe Glu Gin Arg 
350 

Val Ser Phe Tyr 
365 

Ala Arg Lys Val 



Glu Leu Val Gly 
400 

Thr Glu Leu Val 
415 

Ala Val Ala Gly 
430 

Ser Val Val Ala 
445 



Gin Ala Met Phe Glu Gly Ser Gin He Gin Ala Gly Ala He Leu Gly 
450 455 460 



Pro Glu Gly Thr Gin Glu Gin Pro Leu Phe Glu Gly Gly Leu Thr Pro 
465 470 475 480 

Ser Gly Asn Gly Leu Thr Trp Val Ala Ala Asp 
485 490 

<210> 22 
<211> 217 
<212> PRT 

<213> Actinoplanes sp . 
<400> 22 

Met Thr lie Arg Val Leu He Ala Asp Asp Gin Ala Met He Arg Ser 
15 10 15 

Gly Leu Arg Leu He Leu Glu Asp Glu Pro Asp He Glu Val Val Ala 
20 25 30 

Glu Ala Val Asp Gly Val Asp Ala Val Ala Gin Ala Arg Lys Leu Arg 
35 40 45 

Pro Asp Val Cys Leu Val Asp He Arg Met Pro Arg He Asp Gly He 
50 55 60 

Glu Val Thr Arg Ser Leu Ala Gly Pro Gly Val Val Asn Pro Leu Arg 
65 70 75 80 

Val He Val Val Thr Thr Phe Asp Ser Asp Glu Tyr Val Tyr Gly Ala 
85 90 95 

Leu Arg Gly Gly Ala Val Gly Phe He Leu Lys Asp Ala Gly Pro Thr 
100 105 110 

Leu Leu Val Glu Ala Val Arg Ala Ala His Lys Gly Asp Ala Leu Val 
115 120 125 

Ser Pro Ser Val Thr Val Arg Leu Leu Asn His Leu Asn Ala Ser Ala 
130 135 140 

Ala Pro Ala Gly Ser Glu Pro He Pro Leu Ser Asp Arg Glu Leu Glu 
145 150 155 150 

Val Ala Arg Ala He Ala Arg Gly Arg Thr Asn Gin Glu He Ala Ala 
165 170 175 

Asp Leu Phe He Ser Leu Ser Thr Val Lys Gly His Ala Ser Thr He 
180 185 190 

Gin Ser Lys Leu Gly Val Arg Asn Arg Val Gly Val Ala Ala Trp Ala 
195 200 205 

Trp Glu Asn Arg Leu Val Glu Gly Ser 
210 215 



<210> 23 
.<211> 403 



<212> PRT 

<213> Actinoplanes sp . 
<400> 23 

Met Asn He Ala Ala Ala Thr Gly Pro Ala Ala Gly Asp Gly Ala Gly 
15 10 15 

He Arg Thr Leu Gly Ser Val Arg Thr Ala Asp Arg Thr Thr Thr Met 
20 25 30 

Val Ala Asp Ala Gly Leu Ala Val Leu Phe Val Ala Ala Val Val Val 
35 40 45 

Glu Ala Val Ala Val Ala Gin Ser Trp Gly Leu Ala Tyr Trp Leu He 
50 55 60 

Gly Gly Ala Ala Ala Thr Leu Val Cys Leu Leu Ala Leu He Arg Arg 
65 70 75 80 

Arg Gly Pro Val Pro Cys Ala Ala Ala Gly Leu Thr He Ala Ala Gly 
85 90 95 

Ala Val Val Thr Ala Ala Val Leu His Met Pro Ala Glu Pro Gly Pro 
100 105 110 

Ala Met Ala Leu Ala Leu Ala Val Leu Thr Gly Ser Ala Val Arg Ala 
115 120 125 

Ala Pro Thr He Pro Ala Phe Ala Val Gly Gly Ala Ala Leu Gly Val 
130 135 140 

Val Ala Leu Ser Gin Val Ala Ala Ala Thr Trp Asp Ala Gly Pro Ala 
145 150 155 160 

Pro Val Thr Trp Leu Asn He Leu Thr Trp Leu Gly Gly Thr Ala Thr 
165 170 175 

Gly Leu Ser Leu Arg Thr Val Asp Gly Arg Ala Arg Ala Asn Ala Glu 
180 185 190 

Arg He Arg Gin Glu Glu Arg Leu Glu Leu Ala Arg Glu Leu His Asp 
195 200 205 

val Val Ala His His He Thr Gly Met He Leu Gin Thr Gin Ala Ala 
210 215 220 

Gin Val Leu Ala Arg Arg Asp Ala Gly Arg Val Pro Glu Arg Leu Ala 
225 230 235 240 

Val He Glu Thr Ala Gly Thr Glu Ala Leu Ala Ala Met Arg Arg Val 
245 250 255 

Val Gly Leu Leu Arg Asp Ala Asp Asp Gly Pro Pro Ser Ala Pro Glu 
260 265 270 

Pro Glu Glu Leu Ser Thr Leu Val Glu Arg Phe Ser Arg Gin Gly Gly 
275 280 285 



Pro Val Arg Leu Thr Thr Pro Asp Gly Met Lys Gin Trp Pro He Glu 
290 295 300 



Val Thr Thr Thr Val 
305 

Ala Arg His Ala Pro 
325 

Glu Gin Ala Asp Glu 
340 

Ala Pro Pro Arg Leu 
355 

Arg Glu Arg Val Glu 
370 

Pro Gly Gly Gly Trp 
385 

Glu Arg Arg 



Tyr Arg lie Val Arg Glu 
310 315 

His Ala Pro Asn Val Thr 
330 

lie Arg Val Glu Val Thr 
345 

His His Arg Gly Gly Tyr 
360 

Ser Leu Gly Gly Thr 
375 

Ser Val Ala Ala Thr 
390 



Ala Leu Thr Asn Val 
320 

Val Thr Val Thr Val 
335 

Asn Asp Ala Ala Ala 
350 

Gly Leu Val Gly Met 
365 

Leu Ser Thr Gly Pro Arg 
380 

Leu Pro Asn Pro Pro Arg 
395 400 



<210> 24 
<211> 309 
<212> PRT 

<213> Actinoplanes sp . 
<400> 24 

Met Lys Ala Met Ser His Glu Arg Ser Thr Pro Val Leu Gin Ala Glu 
15 10 15 

Gly Leu Thr Lys Arg Tyr Gly Arg Arg Arg Ala Leu Thr Asp Cys Thr 
20 25 30 

Leu Ser Val Pro Ser Gly Arg Val lie Ala Leu Val Gly Pro Arg Gly 
35 40 45 

Ser Gly Lys Ser Thr Leu Leu Gin Leu Cys Cys Gly Met Val Ala Pro 
50 55 60 

Ser Arg Gly Arg lie Arg Val Leu Gly Glu Arg Pro Asp Ala Gly Ala 
65 70 75 80 

Ala His Leu Ala Arg Val Gly Tyr Val Pro Arg Glu Pro Ala Val Tyr 
85 90 95 

Gly Ser Phe Thr Val Glu Asp His Leu Thr Met Gly Ala Arg Leu Asn 
100 105 110 

Pro Arg Trp Asp Arg Arg Leu Ala Asp Arg Arg lie Ala Ser Ala Gly 
115 120 125 

lie Pro Arg Thr Arg Arg Ala Asp Arg Leu Ser Ala Gly Gin Arg Ala 
130 135 140 

Glu Leu Ala Leu Thr Leu Ala Gly Gly Lys Arg Pro Glu Leu Leu Val 
145 150 155 160 



Leu Asp Glu Pro Gly Ala Val Leu Asp Ala Pro Ala Arg Ala Ser Phe 
165 170 175 



Leu Arg Gly Val Leu Asp Phe Val Ala Glu He Asp Ala Ser Val Leu 
180 185 190 

He Ser Gly His Pro Ser Gly Glu Val Glu Arg Leu Cys Asp His Leu 
195 200 205 

He Val Leu Ser Asp Ser Arg Val Leu Val Ala Gly Asp Val Arg Asp 
210 215 220 

Leu Leu Ala Arg His His Arg He He Ala Pro Arg Gly Glu Leu Asp 
225 230 235 240 

Arg Leu Pro Pro Gly Met Glu Pro He Trp Val Glu Asp Phe Gly Ser 
245 250 255 

Tyr Ser Gly Gly Val Val Arg Ala Glu Val Asp Leu Pro Arg Arg Pro 
260 265 270 

Trp Thr Val Glu Arg Val Glu Leu Glu Glu Leu Val Leu Ser Tyr Leu 
275 280 285 

Ser Arg Ala Ser Gly Ala Pro Ala Leu Ala Gly Cys Leu He Ala Pro 
290 295 300 

Gly Gin Pro Gly Ser 
305 

<210> 25 
<211> 553 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc„f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 25 

Val Thr Ala Ala Ala Leu Glu Lys 
1 5 

Gly Asn Pro Val Gly Tyr Ala Ala 
20 

Glu Met Leu Ala Glu Gly Glu Arg 
35 40 

Ala Glu Phe Val Pro Val Ala Tyr 
50 55 

Arg Leu Ala Glu Val Leu Arg Ala 
65 70 



Leu Leu Gly Asp Ala Arg Asp Pro 
10 15 

Val Leu Ala Ala Asp Glu Arg Gin 
25 30 

Leu Leu Asp Arg Tyr Gin Leu Asn 
45 

Gly Gly Arg Leu Ala Arg Ala Asp 
60 

Val Trp Arg Arg Asp Pro Cys Leu 

75 80 



Gly Leu Gly Tyr Gly Phe Ser Ser Leu He Ala Ser Val Asn Val Trp 
85 90 95 



Cys Ala Gly Asn Glu Glu Gin Arg Arg Arg Ala Ala Gly Leu Leu Leu 
100 105 110 



Ala Asn Lys Arg lie Ala Ala Ala Phe His Glu Leu Ala His Gly Thr 
115 120 125 

Asp Phe Ser Ala Ala Glu Cys Ala Ala Arg Pro Ala Gly Gly Gly Trp 
130 135 140 

Val Leu Ser Gly His Lys Glu lie Val Thr Asn Leu Arg Arg Ala Glu 
145 150 155 160 

Ala Met Val Leu Phe Ala Arg Thr Gly Glu Ala Arg Gly Ser Arg Ser 
165 170 175 

His Ser Gin Phe Leu Leu Val Arg Asp Glu Leu Pro Ala Ala Arg Ala 
180 185 190 

Val Asp Arg Pro Arg Tyr Pro Gly Ser Gly Met Arg Gly lie Asp Leu 
195 200 205 

Gly Gly Leu Val Phe Asp Asp Cys Pro Val Pro Ser Ser Ala Leu Leu 
210 215 220 

Gly Glu Gin Gly His Gly lie Glu Val Ala Leu Arg Ala Tyr Gin Val 
225 230 235 240 

Thr Arg Met Val Ser Pro Ala Leu Leu Val Gly Pro Leu Asp Ser Ala 
245 250 255 

Val Arg Leu Ala Thr Glu Met Ala Met Glu Arg Arg Leu Tyr Gly Ala 
260 265 270 

Ala Val Ala Asp Leu Pro Tyr Val Arg Thr Thr lie Ala Arg Ala Tyr 
275 280 ' 285 

Ala Ala Leu Leu Thr Val Asp Val Phe Ser Gly Val Gly Leu Arg Ala 
290 295 300 

Leu His Leu Leu Pro Glu Ala Thr Ala Gly Tyr Ala Pro Ala Val Lys 
305 310 315 320 

Tyr Leu Thr Ala Gin lie Val Leu Asp Ala lie Asp Asp Leu Arg Ser 
325 330 335 

Val Leu Gly Ala Gin Gly Tyr Leu Arg Gin Gly Pro Tyr Ala Met Phe 
340 345 350 

Gin Lys Leu Val Arg Asp Ala Ala Pro Ala Ser Phe Ala His Val Ser 
355 360 365 

Arg Ala Ala Cys Leu Val Met Leu Leu Pro His Leu Pro Arg Leu Ala 
370 375 380 

Arg Arg Ser Trp Thr Ala Glu Glu Pro Pro Pro Asp Asn Val Phe Thr 
385 390 395 400 

Leu Gly Gly Glu Leu Ser Pro Leu Asp Phe Ser Arg Leu Val Ser Gly 
405 410 415 



Met Arg Gly Asp Pro Leu Ala Gly Val Leu His Asp Ser Trp His Asp 



420 



425 



430 



Glu Gly Pro Val Gly Arg Phe Ala Glu Arg Phe His Arg Glu Leu Thr 
435 440 445 

Gly Leu Arg Asp Ala Cys Arg Glu Leu Gly Pro Ala Asp lie Thr lie 
450 455 460 

Asp Ala Asn Pro Ala Ala Phe Ala Leu Ala Asp Arg Tyr Thr Val Leu 
465 470 475 480 

Leu Ala Ala Ala Cys Ala Leu Gly Val Trp Arg Ala Gly Gly Arg Leu 
485 490 495 

His Arg Pro Ala Leu Leu Ala Val Leu Asp Gly Leu Ala Gly Arg Leu 
500 505 510 

Gly Gly Glu Ala Val Leu Ser Val Ala Glu Arg Glu His Val Glu His 
515 520 525 

Gin Leu Phe Glu Met Ala Ala Asp Arg Val Arg Thr Ser Arg Leu Leu 
530 535 540 



Asp Leu Ser Ala Arg 


Gin 


545 




550 


<210> 


26 




<211> 


585 




<212> 


PRT 




<213> 


Act inop lanes 


sp. 


<400> 


26 





Met Thr Val Arg Pro Leu Ala Pro Pro Ala Glu Val Arg Leu Asp Asp 
15 10 15 

Leu Leu Gly Pro Glu Asp Ala Trp Asp Ala Glu Thr Ala Ala Arg Asp 
20 25 30 

lie Ala Glu Glu Phe Pro Ala Arg Leu His Asp Arg Leu Asn Ser Phe 
35 40 45 

Gly Leu Gin Ser Trp Tyr Val Pro Pro Glu Trp Gly Gly Ala Pro Gly 
50 55 60 

Asp His Glu Arg Leu Leu His Leu Trp Arg Ala Val Ala Arg Arg Asp 
65 70 75 80 

Leu Ser Ala Ala Val Ala His Gly Lys Thr Tyr Leu Gly Ser Ala Pro 
85 90 95 

Val Trp Leu Ala Gly Asp Asp Gly Gin Arg Ala Thr Leu Ala Ala Ala 
100 105 110 

Val Leu Ala Gly Thr Pro Val Ala Trp Ala Leu Ser Glu Pro Asp His 
115 120 125 

Gly Ala Asp Leu Leu His Gly Thr Thr Thr Ala Leu Pro His Asp Ala 
130 135 . 140 

Gly Tyr Arg Leu Arg Gly Leu Lys Trp Pro lie Asn Asn Ala Thr Arg 



117 



145 



150 



155 



160 



Ala Arg Tyr Leu 



Arg Gly Gin Ser 
180 

Thr Trp Leu Pro 
195 

Asp lie Ser Gly 
210 

Leu Leu Gly Arg 
225 

Gin Leu Thr Arg 



Arg Ala Leu Arg 
260 

Arg Arg Pro Leu 
275 

Cys Ala Ala Leu 
290 

Arg Ser Val His 
305 

Val Lys Ala Tyr 



Ala Glu Leu Leu 
340 

Gly Met Phe Pro 
355 

Asp Gly Ser Thr 

370 

Arg Leu Ala Ala 
385 

Ala Glu Ala Ser 



Ala Leu Thr Leu 
420 

Pro Ala Leu Ala 
435 

Ala Arg His Ala 
450 

Gin Met Thr Glu 
465 



Thr Val Leu Ala 
165 

Leu Phe Leu Val 



Arg Pro Lys Val 
200 

lie Ala Phe Glu 
215 

Ala Gly Ser Gly 
230 

Thr Met Cys Ala 
245 

Leu Thr Ala Arg 



Leu Asp Arg Gly 
280 

Leu Ala Ala Ala 
295 

Ser Leu Thr Ala 
310 

Val Pro Thr Val 
325 

Gly Ser Arg Ser 



Lys Leu Val Arg 
360 

Pro Val Val Arg 
375 

Gly Phe Ala Ala 
390 

Ala Ala Gly Gin 
405 

Leu Ser Arg His 



Val Ser Ala Ala 
440 

Ala Ala Leu Ala 
455 

Leu Gly Pro Ser 
470 



Arg Thr Gly Arg 
170 

Asp Lys Glu Ala 
185 

Ala Thr His Gly 



Asp Ala Gly Leu 
220 

Leu Glu Thr Val 
235 

Gly Leu Ser Leu 
250 

Phe Val Ala Gin 
265 

His Pro Ala Gly 



Glu Ala Thr Ala 
300 

Glu Met Ser Val 
315 

Val Asp Arg Val 
330 

Phe Leu Arg Asp 
345 

Asp His His Val 



Thr Ala Leu Ala 
380 

Gly Ala Val Ser 
395 

Pro Pro Pro Pro 
410 

Gly Cys Ser Val 
425 

Val Arg Gly Gly 



Gly Glu Ala Arg 
460 

Ala Arg Pro Ser 
475 



Ala Gly Asp Ala 
175 

Leu Ala Pro Gly 
190 

Val Arg Gly lie 
205 

Pro Gly Thr Ala 



Leu Arg Ser Leu 
240 

Gly Ala Gly Asp 
255 

Arg Met lie Met 
270 

lie Leu Ala Arg 
285 

Val Val Gly Thr 



Thr Ser Ala lie 
320 

Leu Arg Glu Leu 
335 

Glu Tyr Glu His 
350 

Val Ala Val Phe 
365 

His Gin Phe Pro 



Ala Glu Gly Leu 
400 

Leu Asp Arg Gly 
415 

Val Gin Ala Leu 
430 

Pro Ala Gly Leu 
445 

Arg lie Cys Gly 



Met Val Gly His 
480 



Glu Leu Ala Ala Ala Tyr Glu Trp Cys Tyr Ala Gly Ala Ala Cys L.eu 
485 490 495 



Leu Leu Trp Thr Ser Ala Glu Gly Arg His Thr Ala Asp Pro Leu Trp 
500 505 510 

Ala Asp Gly Leu Trp Val Leu Ala Ala Leu Arg Ala Val Arg Arg Glu 
515 520 525 

Leu Ala Arg Val Leu Arg Ala Pro Ala Pro Asp Pro Gly Pro His Asp 
530 535 540 

Asp Gly Ala Asp Arg Leu Leu Ala Ala Arg Val Ala Ala Ala Ala Arg 
545 550 555 560 

Thr Gly Gl*u Pro Val Thr Pro Phe Gly Thr Ala Leu Arg Pro Pro Ala 
565 570 575 

Gly Thr Val Arg Ala Glu Asp Gly Arg 
580 585 

<210> 27 
<211> 587 
<212> PRT 

<213> Actinoplanes sp. 
<400> 27 

Met Val lie Asp Ala Ala Thr Gin Pro Thr Val Pro Asp Ala Phe Arg 
15 10 15 

Ala Gin Ala lie Ala Arg Pro Gly Glu Pro Ala Leu Val Val Leu Pro 
20 25 30 

Gly Asp Pro Asp Ala Glu Pro Val Thr Leu Thr Tyr Ala Glu Leu Asp 
35 40 45 

Arg Arg Ala Ala Ala Arg Ala Ala Trp Leu Ala Ala Arg Phe Pro Ala 
50 55 60 

Gly Glu Arg lie Leu lie Ala Leu Pro Thr Gly Ala Glu Phe Val Glu 
65 70 75 80 

Leu Tyr Leu Ala Cys Leu Tyr Ala Gly Leu Val Ala Val Pro Ala Pro 
85 90 95 

Pro Pro Gly Gly Ser Ser Gly Ala Ser Glu Arg Thr Val Gly lie Ala 
100 105 110 

Ala Asp Cys Ser Pro Ala Leu Ala Val Val Asn Ala Asp Asp Ala Ala 
115 120 125 

Pro Leu Thr Ala Val Leu Arg Glu Arg Gly Leu Ser Gly Leu Pro Val 
130 135 140 

Gly Ala Leu Pro Pro Leu Ala Ala Glu Ala lie Arg Pro Pro Arg Gly 
145 150 155 160 



Pro Arg Pro Asp Ser Leu Ala Val Leu Gin Tyr Ser Ser Gly Ser Thr 
165 170 175 



Gly Ser Pro Lys 
180 

Leu Arg Ala Phe 
195 

Gly Ser Trp Leu 
210 

Thr Ala Gly Leu 
225 

Ala Phe Val Arg 



Arg Val Thr lie 
260 

Arg Ala Val Arg 
275 

Arg Thr Leu Tyr 
290 

Ala Phe Thr Glu 
305 

Asn Pro Cys Tyr 



Val Phe Glu Ala 
340 

Glu Asp Ala Ala 
355 

Arg Glu lie Pro 
370 

Val Asp Pro Asp 
385 

lie Trp Leu Arg 



Glu Leu Asn Pro 
420 

Asp Gly Gly Trp 
435 

Glu Leu Phe Leu 
450 

Arg Asn Leu Ala 
465 

Asp Ala Val Asp 



Gly Val Met Leu 



Asp Arg Ser Ser 

200 

Pro Leu His His 
215 

Leu Asn Gly Ala 
230 

Arg Pro Ala Asp 
245 

Ser Ala Ala Pro 



Asp Glu Gin lie 
280 

Asn Gly Ser Glu 
295 

Arg Phe Ala Pro 
310 

Gly Met Ala Glu 
325 

Pro Ala Val Phe 



Ser Pro Ala Leu 
360 

Gly Val Gly Arg 
375 

Gly Leu Arg Pro 
390 

Gly Pro Gly Ala 
405 

Gly lie Phe Asp 



Val Arg Thr Gly 
440 

Thr Gly Arg Leu 
455 

Pro His Asp Leu 
470 

His Gin lie Gly 
485 



Ser His Arg Ala 
185 

Gly His Asn Ser 



Asp Met Gly Leu 
220 

Gly Val Val Leu 
235 

Trp Leu Arg Met 
250 

Asn Phe Ala Tyr 
265 

Ala Gly Leu Asp 



Pro Val Asn Pro 
300 

Phe Gly Leu His 
315 

Phe Thr Ala Tyr 
330 

Leu Pro Ala Asp 
345 

Arg Pro Ala Asp 



Val Pro Asp Phe 
380 

Leu Pro Glu Gly 
395 

Gly Ala Gly Tyr 
410 

Ala Arg Pro Ala 
425 

Asp Leu Gly Ala 



Lys Glu Leu Leu 
460 

Glu Arg Glu Ala 
475 

Ala Ala Phe Gly 
490 



Val Leu Ala Asn 
190 

Asp Asp Val Phe 
205 

Phe Ala Met Leu 



Met Ser Pro Thr 
240 

Met Asp Arg Tyr 
255 

Asp Leu Cys Val 
270 

Leu Ser Arg lie 
285 

Ala Thr Val Arg 



Thr His Ala Val 
320 

Val Ser Thr Lys 
335 

Pro Arg Ala Leu 
350 

Pro Ala Ala Ala 
365 

Glu Val Leu lie 



Arg Val Gly Glu 
400 

Trp Gly Arg Thr 
415 

Gly Asp Gly Gin 
430 

Leu Thr Gly Gly 
445 

lie Val His Gly 



Arg Ala Ala His 
480 

Val Pro Ala Pro 
495 



Asp Glu Arg lie Val Leu Val Gin Glu Val His Pro Arg Thr Pro Leu 
500 505 510 



Asp Glu Leu Pro Arg Val Ala Ser Ala Val Ser Arg Arg Leu Thr Val 
515 520 525 

Ser Phe Gly Val Pro Val Arg Asn Val Leu Leu Val Arg Arg Gly Thr 
530 535 540 

Val Arg Arg Thr Thr Ser Gly Lys lie Arg Arg Thr Ala Val Arg Glu 
545 550 555 560 

Arg Phe Leu Ala Gly Gly lie Thr Ala Leu His Ala Glu Leu Glu Pro 
565 - 570 575 

Ala Leu Arg Pro Val Gin Ala Gly Ala Gly Arg 
580 585 

<210> 28 
<211> 75 
<212> PRT 

<213> Actinoplanes sp . 
<220> 

<221> inisc_f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. Ti is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 28 

Val Pro Asn Pro Phe Glu Asp Pro Asp Ala Asn Tyr Leu Val Leu Val 
15 10 15 

Asn Asp Glu Gly Gin His Ser Leu Trp Pro Val Phe Ala Asp Val Pro 
20 25 30 

Asp Gly Trp Thr Thr Val Phe Gly Glu Ala Gly Arg Gin Asp Cys Leu 
35 40 45 

Asp Tyr lie Glu Lys Ser Trp Thr Asp Met Arg Pro Lys Ser Leu lie 
50 55 60 

Ala Ala Met Glu Lys Gin Lys Gin Pro Gin Ser 
65 70 75 

<210> 29 
<211> 94 
<212> PRT 

<213> Actinoplanes sp, 
<220> 

<221> inisc_f eature 
<222> (1) . . (1) 

<223> V is a non-standard initiator codon. It is expected that the bio 
synthesized protein will have a f ormylmethionine residue at this 
position 



<400> 29 



U \ 



Val Ala Pro Gly Ala Pro Pro Ala Glu His Gly Glu Ala Val Pro Glu 
15 10 15 

Ala Asp lie Pro Val Leu Arg Asn Arg lie Asp Glu lie Asp Ala Ala 
20 25 30 

lie Met Arg Leu Trp Gin Glu Arg Ala Ser lie Ser Gin Lys lie Gly 
35 40 45 

Ser lie Arg Leu Ala Ser Gly Gly Thr Arg Val Val Leu Ser Arg Glu 
50 55 60 

Gin Glu Val lie Gin Arg Phe Arg Ala Ala Leu Gly Glu Asp Gly Thr 
65 70 75 80 

Thr lie Ala Leu Met Leu Leu Arg Ala Gly Arg Gly Pro Leu 
85 90 

<210> 30 
<211> 619 
<212> PRT 

<213> Actinoplanes sp . 
<22 0> 

<221> misc„f eature 
<222> (1),.(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 30 

Val Asp Val Pro Arg Val Arg Pro Pro Gly Ala Ala Pro Ala Pro Arg 
15 10 15 

Arg Arg Arg Trp Arg Phe Trp Gin Ser Pro Asp Gly Gin Pro Ala Trp 
20 25 30 

Ala Arg Pro Ala Leu Leu Gly lie Ala Ala Leu Ala Ala Val Leu Tyr 
35 40 45 

Thr Ala Asn Leu Ala Arg Ser Gly Tyr Pro Met Tyr Tyr Ala Val Ala 
50 55 60 

Val Lys Ser Met Ser Val Ser Trp Pro Ala Phe Trp Thr Gly Ala Phe 
65 70 75 80 

Asp Pro Ala Ala Ser lie Thr lie Asp Lys Leu Ala Gly Ala Phe Val 
85 90 95 

Pro Gin Ala Leu Ser Ala Arg Val Phe Gly Phe His Gin Trp Ser Leu 
100 105 110 

Ala Leu Pro Gin Ala Val Glu Gly Val lie Ala Val Leu Val Leu Tyr 
115 120 125 

Arg Ala Val Arg Arg Trp His Gly Pro Gly Ala Gly Leu Ala Ala Ala 
130 135 140 



122 



Gly Leu Phe Ala 
145 

Met Glu Asp Gly 



Phe Gly Ala Ala 
180 

Gly Ala Trp lie 
195 

Leu Val Leu Pro 
210 

Arg Ala Arg Ala 
225 

Leu Ala Val Ser 



Ser His Arg Pro 
260 

Met Val Phe Gly 
275 

Gly Ala Leu Thr 
290 

Ser Trp Thr Ala 
305 

Trp Tyr Pro Leu 



Arg Thr Ala Arg 
340 

Ala Ala Val Val 
355 

Ala Val Leu Ala 
370 

Gin Leu Trp Arg 
385 

Ala Val Val Val 



Tyr Pro Ser Phe 
420 

Val Leu Ala Val 
435 

Ala Val Val Val 
450 



Thr Thr Pro lie 
150 

Ala Leu Thr Leu 
165 

Val Thr Arg Gly 



Gly Leu Gly Phe 
200 

Ala Leu Val Val 
215 

Arg Val Val His 
230 

Leu Leu Trp Val 
245 

Trp Ala Asp Gly 



Tyr Asn Gly Phe 
280 

Thr Gly Phe Thr 
295 

Leu Ala Ala Asp 
310 

Ala Leu Thr Gly 
325 

Ala Gly Leu Leu 



Leu Ser Arg lie 
360 

Pro Pro Leu Ala 
375 

Thr His Arg Asp 
390 

Val Gin Ala Gly 
405 

Leu Ala Gly Leu 



Val Val Leu Ala 
440 

Val Val Ala Gly 
455 



Val Ser Ser Met 
155 

Cys Leu Val Leu 
170 

Ser Pro Ala Arg 
185 

Gin Ala Lys Met 



Thr Tyr Leu Ala 
220 

Val Ala Ala Ala 
235 

Leu Ala Leu Thr 
250 

Thr Thr Ser Gly 
265 

Asp Arg Ala Gly 



Asp Gly Gly Ala 
300 

Arg Leu Ala Thr 
315 

Leu Leu Leu Gly 
330 

Phe Trp Gly Leu 
'345 

Thr lie Gin His 



Ala Leu Ala Ala 
380 

Gly Thr Ala Pro 
395 

Trp Thr Leu Trp 
410 

Thr Trp Thr Ala 
425 

Ala Arg Pro Thr 



Leu Leu Ala Val 
460 



Phe Gly His Ser 
160 

Ala Ala Asp Ala 
175 

Leu Ala Leu Ala 
190 

Met Gin Ala Trp 
205 

Gly Ala Pro Val 



Val Ala Ala Thr 
240 

Leu Leu Pro Gly 
255 

Asn Ala Phe Ala 

270 

lie His Val Pro 
285 

Ala Ala Gly Gly 



Gin lie Gly Trp 
320 

Leu Ala Arg Trp 
335 

Trp Leu Leu Thr 
350 

Asn Ala Tyr Leu 
365 

Ala Gly Ala Val 



Trp Leu Leu Pro 
400 

Leu Ala Thr Arg 
415 

Pro lie Ala Ala 
430 

Ala Arg Arg Pro 
445 

Pro Val Ala Trp 



Gly Ala Ser Val Leu Asn Pro Arg Tyr Ala Gly Thr Ser Phe Glu Ala 
465 470 475 480 

Gly Ala Gly Pro Ser Gly Pro Val Gly Val Arg Leu Asp Asp Asp Thr 
485 490 495 

Thr Asp Arg Leu Thr Pro Gly Leu Arg Arg Leu Asp Asp Tyr Leu Ala 
500 505 510 

Ala His Arg Asp Gly Arg Thr Tyr Leu Ala Ala Thr Ser Ser Trp Arg 
515 520 525 

Thr Ala Gly Arg Leu lie Val Pro Thr Gly His Ser Tyr Leu Pro Leu 
530 535 540 

Gly Gly Phe Ser Gly Ala Ala Pro Phe Pro Ser Leu Ala Gly Val Gin 
545 550 555 560 

Arg Leu Val Arg Asp Gly Glu Leu Arg Tyr Phe Val Leu Gly Gly Pro 
565 570 575 

Glu Gly Leu Gly Gly Glu Ala Thr Glu Ala Tyr Arg He Thr Gly Trp 
580 585 590 

Val Leu Glu Thr Cys Ala Thr Val Pro Pro Ala Glu His Gly Ala Asp 
595 600 605 

Pro Asp Leu Thr Val Leu Arg Cys Asp Lys Pro 
610 615 

<210> 31 
<211> 355 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_feature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue a 
t this position 



<400> 31 

Val Asp Asn Gly Thr Phe Thr Asp 
1 5 

Ala Val Ala Asp Val Glu Ser Ala 
20 

Gly Phe Ser Val Tyr Gly Gly Thr 
35 40 

Arg Val Ala Leu Gly Arg Asp Asp 

50 55 

Pro Gly Gly Asp His Pro Ala Met 
65 70 



Leu Arg He Asp His He Glu Phe 
10 15 

Ser Ala Pro Phe Thr Glu Gly Tyr 
25 30 

Gly Asp Ala His Ala Pro Val Arg 
45 

He Arg Leu Val Leu Thr Ala Ala 

60 

Ala Tyr Val Glu Gin His Gly Asp 
75 80 



Gly Val Ser Ala He Ala Leu Ser Thr Arg Asp Ala His Ala Ala Phe 



85 90 95 

Thr Glu Ala Val Arg Arg Gly Ala Val Gly Val Ser Ala Pro Val Thr 
100 105 110 

Gly Asn Gly Val Thr Val Ala Thr lie Arg Gly Phe Gly Asp Val Leu 
115 120 125 

His Thr Phe Val Glu Arg Ala Pro Gly Ala Asp Pro Arg Thr Leu Pro 
130 135 140 

Gly Leu Glu Leu Arg Arg Pro Ser Pro Thr Arg Phe Asp Ser Gly Leu 
145 150 155 160 

Gin Ala lie Asp His lie Ala Val Cys Leu Glu Pro Gly Thr Leu Asp 
165 170 175 

Pro Thr Val Asp Phe Tyr Arg Asp Val Leu Asp Phe Glu Met lie Phe 
180 185 190 

Glu Glu Arg' lie Leu Val Gly Arg Gin Ala Met Asp Ser Lys Val Val 
195 200 205 

Gin Ser Arg Ser Gly Gly Val Thr Leu Thr Leu lie Glu Pro Asp Thr 
210 215 220 

Ser Leu Glu Gin Gly Gin lie Asp Thr Phe Leu Lys Asn His Gly Gly 
225 230 235 240 

Pro Gly Val Gin His Leu Ala Phe lie Thr Asp Asp Val Leu Arg Ser 
245 250 255 

Val Gly Arg Met Ser Glu His Gly Val Glu Phe Leu His Thr Pro Asp 
260 265 270 

Ser Tyr Tyr Gly Arg Leu Pro Gly Arg lie Pro Gin Ala Gly His Pro 
275 280 285 

lie Gin Ala Leu Arg Asp Leu Asn Val Leu Val Asp Gin Asp His Asp 
290 295 300 

Gly Gin Leu Phe Gin lie Phe Thr Lys Ser Val His Pro Arg Gly Thr 
305 310 315 320 

He Phe Met Glu Val He Glu Arg Met Gly Ala Arg Ser Phe Gly Ser 
325 330 335 

Gly Asn He Lys Ala Leu Tyr Glu Ala Val Glu Leu Asp Met Ser Lys 
340 345 350 

Gin Ser Ala 
355 

<210> 32 

<211> 429 

<212> PRT 

<213> Actinoplanes sp, 

<400> 32 



Met Glu Ser Pro Ala Thr His Ala Glu Leu Val He Gly Thr Val Leu 



15 10 15 

Leu Asp He Ala Leu Val Leu Ala Ala Gly Ala Leu Leu Gly Arg Trp 
20 25 30 

Val Arg Arg Leu Arg Gin Pro Ala Val He Gly Glu He Leu Ala Gly 
35 40 45 

He Ala Leu Gly Pro Ser Leu Leu Gly Leu Leu Pro Gly Asn Pro Thr 
50 55 60 

Ala Trp Leu Phe Pro Ala Glu Ala Arg Pro Tyr Leu Ser Ala Val Ala 
65 70 75 ^ 80 

Gin He Gly Leu Ala Leu Phe Thr Phe Leu He Gly Trp Glu Phe Asn 
85 90 95 

Pro Ala Thr Leu Ala Arg His Arg Gly Thr Ala Ala Ala Val Ser He 
100 105 110 

Gly Ser He Ala Val Ser Phe Gly Leu Gly He Ala Leu Ala Thr Val 
115 120 125 

Leu His Pro Arg His Asp Thr Thr Gly Gly Gly Lys Val Gly Phe Thr 
130 135 140 

Glu Phe Ala Leu Phe Leu Gly Val Ala Met Ser He Thr Ala Phe Pro 
145 150 155 160 

Val Leu Ala Arg He Leu Ala Glu Arg Arg Leu Thr Gly Thr Arg Val 
165 170 175 

Gly Ser He Ala Leu Val Ser Ala Ala He Asp Asp Val Val Ala Trp 
180 185 190 

Cys Leu Leu Ala Leu Val Thr Ala He Ala Thr Ala Ser Gly Pro Val 
195 200 205 

Gin Leu Val Arg He Leu Ala Leu Leu Ala Val Phe Leu Val Val Leu 
210 215 220 

Val Thr Val Val Arg Pro Leu Leu Val Leu Leu Ala Arg Arg Pro Ser 
225 230 235 240 

Ala Ser Tyr Leu Leu Val Ala Val Val Ala Val Val Leu Leu Ser Ala 
245 250 255 

Tyr Ala Thr Thr Trp He Gly Leu His Ala He Phe Gly Ala Phe Cys 
260 . 265 270 

Ala Gly Leu Val Met Pro Arg Glu Pro Ala Ala Ala Leu Arg Glu Arg 
275 280 285 

Val Arg Gin Pro Leu Glu His Val Ser Val Val Leu Leu Pro Val Phe 
290 295 300 

Phe He Val Thr Gly Leu Gly Val Asp He Gly Ala Leu Thr Ala Ala 
305 310 315 320 

Asn He Leu Glu Leu Ala Ala He He Val He Ala Cys Ala Gly Lys 
325 330 335 



126 



Leu Ala Gly Ala lie Val Pro Ala Val Ser Leu Gly Met Ser Trp Arg 
340 345 350 



Asp Ala Arg Thr Leu Gly Leu Leu Val Asn Thr Arg Gly Leu Thr Glu 
355 360 365 

Leu Val Val Leu Asn Val Gly Leu Gin Leu Ala Val Leu Asp Gly Gin 
370 375 380 

Met Phe Thr Met Met Val Leu Met Ala Leu Val Thr Thr Ala Leu Ala 
385 390 395 400 

Gly Pro Leu lie Gly Ser Ala Arg Thr Pro Ala Ala Gly Ala Pro Ala 
405 410 415 

Gin Ala Leu Pro Ala Glu Pro Arg Thr Arg Arg Ala Ala 
420 425 

<210> 33 
<211> 189 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> inisc_f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 33 

Val Ser Asp Glu Ala Ala Val Pro Ser Pro Ala Arg Leu Leu Arg Asp 
15 10 15 

Phe Val Asn Thr Tyr Glu Pro Gin Val Asp Asp Glu Ser Leu Ser Thr 
20 25 30 

Pro Asp Ala Leu Arg Ala Trp Leu Ala Gly Glu Ser Leu Leu Ala Pro 
35 40 45 

Gly Ala Arg Val Arg Pro Ala Asp Leu Ala Arg Ala Val Ala Leu Arg 
50 55 60 

Glu Gly Leu Arg Gin Val Leu Leu Gly His Ala Gly His Pro Ala Asp 
65 70 75 80 

Pro Ala Ala Leu Arg Arg Leu Glu Glu lie Leu Ala Ala Val Pro Val 
85 90 95 

Arg Leu Ser Leu Ala Gly Gly Ala Pro Arg Leu Leu Pro Ala Gly Gly 
100 105 110 

Thr Pro Phe Asp Arg Ala Leu Ala Gly Leu lie Asp Ala Val Arg Gin 
115 120 125 



Cys Ala Glu Leu Gin Val Trp Thr Arg Leu Lys Val Cys Asp Arg Asp 
130 135 140 



Thr Cys Arg Trp Ala Tyr Tyr Asp Ala Ser Arg Asn Gin Ala Arg Arg 
145 150 155 160 



Trp Cys Ser Met Ala Gly Cys Gly Asn Tyr lie Lys Met Arg Arg Ala 
165 170 175 

Tyr Ala Ala Arg Arg Val Arg Gly Ser Ala Gly Ser Ala 
180 185 

<210> 34 
<211> 309 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> niisc__f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 34 

Val Ala Thr Thr Leu Arg Asp Val Ala Arg Leu Ala Arg Val Ser Val 
15 10 15 

Lys Thr Val Ser Asn Val Val Asn Asp His Pro His Val Ser Asp Asp 
20 25 30 

Val Arg Arg Arg Val Glu Thr Ala He Arg Gin Leu Gly Tyr Arg Pro 
35 40 45 

Asn Leu Val Ala Arg Ala Leu Arg Ser Gly Arg Gly Ser Gly Leu Leu 
50 55 60 

Ala Leu Ala Met Pro Gly Ala Gly Ala Pro Gin Ser Pro Ala Leu He 
65 70 75 80 

Glu Glu He He Arg Arg Ala Ala Pro Leu Gly Phe Arg Val Leu He 
85 90 95 

Glu Pro Leu Glu Ser Ser Arg Pro Arg Pro Pro Ala Pro Gly Val Asp 
100 105 110 

Ala Arg Leu Leu Asn Ala Glu Ala Pro Ala Pro Glu Leu Val Asp Ala 
115 120 125 

Gin Ala Ala Thr Gly Thr Pro Leu Val Leu Leu Thr Gly Thr Pro Asp 
130 135 140 

Pro Arg Tyr Asp Cys Val Gly Pro Asp Ala Ala Arg Ala Ala Glu Asp 
145 150 155 160 

Ala Val Asp His Leu Arg Arg Leu Gly Arg Arg Arg Val Ala Thr He 
165 170 175 

Gly Gly Ser Leu Ser Thr Gly Pro Ala Gly Ser Gly Ser Asp Phe Gly 
180 185 190 

Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly 



128 



195 



200 



205 



Ser Gly Ser Gly 
210 

Ser Gly Phe Gly 
225 

Leu Leu Gly His 



Arg Leu Ala Val 
260 

Val Pro Glu Asp 
275 

Arg Tyr Thr Arg 
290 



Ser Gly Ser Gly 
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